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NOVEL MOLECULES 

FIELD OF THE INVENTION 

5 The present invention related generally to novel molecules and more particularly novel 
proteinaceous molecules involved in or associated with regulation of cell activities and/or 
viability. The present invention is particularly directed to novel serine proteinases and a novel 
kmase and to derivatives, agonists and antagonists thereof. In one embodiment, the present 
invention provides a novel serine proteinase, referred to herein as "HELAr or "testisin". which 

10 has roles in spermatogenesis, in suppressing testicular cancer and as a maricer for cancers. 

BACKGROUND OF THE INVENTION 

The rapidly increasing sophistication of recombinant DNA technology is greatly faciUtating 
15 research and development in the medical and allied health fields. This is particularly the case in 
the area of ceU regulation leading to a greater understandmg of the events leading to or involved 
in cancer, development of acquired immunodeficiency disease syndrome (AIDS), neurological 
disorders, heart disease, tissue graft rejection and infertUity amongst many other conditions. 

20 Two particularly important classes of molecules are the proteinases and kinases. 

Proteinases play important roles in a number of physiological and pathological processes such 
as proteolytic cascades involved in blood coagulation, fibrmolysis and complement activation as 
weU as deavage of growth factors, hormones and receptors, the release of bioactive molecules 
25 and processes involving cell proliferation and development, inflammation, tumour growth and 
metastasis. Of particular significance are the cellular protemases. or those proteinases 
synthesized in cells and tissues which serve to activate or deactivate proteins responsible for 
performing specific fiinctions. These proteinases may be found outside the cell, within the cell 
or may be present on the cell surface. 

30 

Serine proteinases are particularly important. These proteinases are characterised by a 
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mechanism involving serine, histidine and aspartate amino acids in the serine proteinase active 
site. Members of the serine proteinase family which play important roles in a range of cdlular 
functions and which have demonstrated causative roles in human diseases include tissue-type 
plasminogen activator and thrombin (thrombosis and blood clotting), urokinase-type plasminogen 
5 activator (cancer and metastasis), tiypsin and elastase (emphysema and Uver disease) and 
angiotrasin converting enzyme (hypertension). 

A serine proteinase is also impUcated in TNFa degradation and soluble TNF-receptor (p75) 
release by THPl cells (Vey et at Eur. J. Imm. 26. 2404-2409, 1996). Serine proteinases have 

10 been inplicated in the activation of macrophages (Nakabo et ai J. Leukocyte Biol. 60. 328-336, 
1996), in nuclear laminin degradation in apoptosis (McConkey etal. J.Biol. Chem.. 271. 22398- 
22406, 1996), in prostaglandin-E2 induced release of soluble TNF receptor shedding (Choi et 
al. Cellular Immunology 170, 178-184, 1996), in PAF synthesis (BussoHno et al. Eur. J. 
Immunol. 24. 3131-3139, 1994), and in the proteolysis of IkB, a regulatory molecule important 

15 in signal transduction and jq)optosis. Release of serine proteinases known as granzymes is central 
to CTL killing and many of the substrates cleaved by granzymes are also cleaved by cellular 
proteinases (for exan5)le, IHp is a substrate for Q^mzyme B as well as the cysteine proteinase, 
interleukin ip-converting enzyme (ICE)). Granzyme A, a serine proteinase with Arg-amidolytic 
activity, has been reported to induce the production of IL-6 and IL-8 in lung fibroblasts (Sower 

20 etal. Cellular Immunology 171. 159-163, 1996) and cleaves IL-lB to a 17kD mature form that 
is biologically active. 

Kinases are a large group of molecules, many of which regulate the response of cells to external 
stimuB. These molecules leguiate proliferation and differentiation in eukaryotic cells frequently 
25 via signal transduction pathways. 

The klentification of new serine proteinases and kinases permits the development of a range of 
derivatives, agonists and antagonists at the nucleic acid and protein levels which in turn have 
applications in the treatment and diagnosis of a range of conditions such as cancer, inflammation, 
30 neurological disorders amongst many other conditions including conditions which initiate or 
promote e^ptosis such as viral infection, old age and dmg abuse. One particularly useful serine 
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proteinase HELA2 (testisin) identified in accordance with the present invention is involved in 
spermatogenesis, testicular cancer and as a marker for cancer. ; i 

SUMMARY OF THE INVENTION 

5 

Sequence Identity Numbers (SEQ ID NOs.) for the nucleotide and amino acid sequences referred 
to in the specification are defined at the end of the subject specification. 

Throughout this specification and the claims which foOow, unless the context requires otherwise, 
10 the word "comprise", or variations such as "comprises" or "comprising", will be understood to 
imply the inclusion of a stated integer or group of integers but not the exclusion of any other 
integer or group of integers. 

One aspect of the present invention provides a novel molecule in isolated form involved in or 
15 associated with regulation of cell activity and/or viability. 

Another aspect of the present invention contemplates an isolated proteinaceous molecule 
involved in or associated with regulation of cell activity and/or viability comprising a sequence 
of amino acids encoded by a nucleotide sequence, at least a portion of which, is capable of being 
20 amplified by polymerase chain reaction (PGR) using the following primers: 

5' ACAGAATTCTGGGTIGTIACIGCIGCICAYTG3' [SEQ ID N0:1]; and 

5' ACAGAATTCAXIGGICCICCIC/GT/AXTCICC3' [SEQIDN0:2]; 

25 

or a complementary form of said primers. 

The proteinaceous molecule of the present invention may be a serine proteinase or a kinase. 

30 Yet another aspect of the present invention is directed to an isolated serine proteinase comprising 
the amino acid sequence substantially set forth in SEQ ID N0:4 or an amino acid sequence 
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having at least about 50% similarity to all or part thereof. This serine proteinase is referred to 
herein as a short isoform (S) of "HELA2" or "testisin". 

Still another aspect of the present invention relates to an isolated serine proteinase comprismg 
5 the amino acid sequence substantiaUy as set forth in SEQ ID N0:6 or an amino acid sequence 
having at least about 50% similarity to aU or part thereof. This serine proteinase is referred to 
herein as a long isofcMm (L) of HELA2 (testisin). 

StiU yet another aspect of the present invention provides an isolated serine proteinase comprising 
10 an amino acid sequence substantially as set forth in SEQ ID N0:8 or an amino acid sequence 
having at least about 50% similarity to aU or part thereof. This serine proteinase is referred to 
herein as "ATC2". 

Even yet another aspect of the present invention is directed to a serine proteinase in isolated form 
15 comprising a sequence of amino acids encoded by a nucleotide sequence substantially as set forth 
in SEQ ID N0:3 or a nucleotide sequence having at least 50% similarity to all or part thereof or 
a nucleotide sequence capable of hybridismg to the sequence set forth in SEQ ID N0:3 under 
low stringency conditions at 42*'C. 

20 AnoUier aspect of the present invention relates to a serine proteinase in isolated form comprising 
a sequence of amino acids encoded by a nucleotide sequence substantially as set forth in SEQ 
ID N0:5 or a nucleotide sequence having at least 50% similarity to aU or part thereof or a 
nucleotide sequence capable of hybridising to the sequence set fortii in SEQ ID N0:5 under low 
stringency conditions at 42"C. 

25 

Still anotiier aspect of the present invention provides a serine proteinase in isolated form 
comprising a sequence of amino acids encoded by a nucleotide sequence substantially as set forth 
in SEQ ID N0:7 or a nucleotide sequence having at least 50% similarity to aU or part thereof or 
a nucleotide sequence capable of hybridising to tiie sequence set forth in SEQ ID NO:7 under 
30 low stringency conditions at 42°C. 
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Another embodiment of the present invention is directed to a kinase in isolated form comprising 
an amino acid sequence substantially as set forth in SEQ ID NO: 10 or having 50% amino acid 
similarity to all or part thereof. This kinase is referred to herein as "BC0N3". 

5 In a related embodiment, the kinase comprises an amino acid sequence encoded by a nucleotide 
sequence substantially as set forth in SEQ ID N0:9 or a nuckotide sequence having at least 50% 
similarity to all or part of the nucleotide sequence set forth in SEQ ID NO:9 or a nucleotide 
sequence capable of hybridising to the nucleotide sequence set forth in SEQ ID N0:9 under low 
stringency conditions at 42'*C. 

10 

The present invention ftirther provides an isolated nucleic acid molecule encoding a polypeptide 
wherein at least a portion of said nucleic acid molecule is capable of being amplified by 
polymerase chain reaction (PGR) using the following primers: 

15 5' ACAGAATTCTGGGTIGTIACIGCIGCICAYTG3' [SEQ ID NO:l]; and 

5'ACAGAATTCAXIGGICCICCIC/GT/AXTCICC3' [SEQ IDN0:2]; 

or a complementary form of said primers. 

20 

The present invention also provides an isolated nucleic acid molecule comprising a sequence of 
nucleotides substantially as set forth in SEQ ID N0:3 or having 50% similarity to aU or part 
thereof or a nucleic acid molecule capable of hybridising to SEQ ID NO:3 under low stringency 
conditions at 42*'C. 

25 

Another aspect of the present invention provides an isolated nucleic acid molecule comprising 
a sequence of nucleotides substantially as set forth in SEQ ID N0:5 or having 50% similarity to 
all or part thereof or a nucleic acid molecule capable of hybridising to SEQ ID N0:5 under low 
stringency conditions at 42°C. 

30 



Still another aspect of the present invention is directed to an isolated nucleic acid molecule 
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conprising a sequence of nucleotides substantially as set forth in SEQ ID NO:7 or having 50% 
similarity to all or part thereof or a nucleic acid molecule capable of hybridising to SEQ ID N0:7 
under low stringency conditions at 42^C. 

5 Even still another aspect of the present invention provides an isolated nucleic acid molecule 
con5)rising a sequence of nucleotides substantially as set forth in SEQ ID NO:9 or having 50% 
similarity to all or part thereof or a nucleic acid molecule capable of hybridising to SEQ ID N0:9 
under low stringency conditions at 42°C. 

10 Another aspect of the present invention provides an isolated serine proteinase encoded by a gene 
proximal to a cluster of genes on a mammalian chromosome. 

More particularly, this aspect of the present invention is directed to a serine proteinase encoded 
by a gene proximal to a cluster of genes or human chromosome I6pl3.3 or its equivalent in a 
15 non-human species. 

Still more particularly, the serine proteinase is encoded by a gene comprising a nucleotide 
sequence substantially as set forth in SEQ ID NO: 3 or 5 or 28 or 29 or 30 or a nucleotide 
sequence having at least 50% similarity to any one thereof or a nucleotide sequence capable of 
20 hybridizing to any one of SEQ ID NO:3 or 5 or 28 or 29 or 30 under low stringency conditions 
at AT'C or a nucleotide sequence encoding a serine proteinase having an amino acid sequence 
substantially as set forth in SEQ ID N0:4 or 6 or an amino acid sequence having at least about 
50% similarity to SEQ ID NO:4 or 6. 



25 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a representation showing (A) schematic and (B) hydrophobicity plot of the HELA2 
aniino acid sequence. 

5 

Figure 2 is a diagrammatic representation showing: (A) the amino acid sequence of HELA2 
(testisin). The putative signal sequence, light chain, heavy chain and transmembrane domains are 
as indicated, the catalytic amino acids. His, Asp and Ser are as designated; insertion of Tyr-Ser 
(YS) 4 amino acids after the catalytic His is found in the long isoform of testisin and is due to 
10 alternative mRNA splicing; (B) in vitro transcription/translation of HELA2 (testisin) showing 
the protein product. 

Figure 3 is a diagrammatic representation of plasmid constructs pBluescriptHELA2(S) and 
pBluescriptEffiLA2(L) containing full length cDNAs for testisin (short isoform (S)) and testisin 
15 (long isoform (L)), respectively. 

Figure 4 is a diagrammatic representation of plasmid constructs pQET(20-295)N and pQET(20- 
295)C, wherein the hydrophobic residues of testisin were removed and the remaining sequences 
cloned into pQE prokaryotic expression plasmids; plasmids pGEX-l (90-279) comprising a 
20 carboxy terminal part of testisin fused to glutathione-S-transferase. 

Figure 5 is a photographic representation of: (A) silver stained gel showing purification of 
recombinant HELA2 (testisin) from E. coll The purified HELA2 (testisin) is indicated by the 
arrow in the eluate fractions. Some HELA2 (testisin) is also found in the wash fractions as the 

25 affinity matrix was not used in excess. His-N21 is one clone containing the amino-terminal His 
tag, and clones His-C21, His-C22 and His-C23 are three different clones with the carboxy- 
terminal His tag. (B) Western blot of native and denatured recombinant HELA2 (testisin) 
probed with Clontech anti-His tag-antibody. The 32kD band shown by the arrow is HELA2 
(testisin). HELA2 (testisin) is not detected in the denatured samples as it appears that 

30 denaturation with urea destroys the His epitope recognised by the monoclonal antibody. 
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Figure 6 is a representatfen of the amino acid sequence of HELA2 (testisin) showing the regions 
of the molecule selected for generation of peptide antigens. 

Figure 7 is a photographic representation of a Western blot of GST-HELA2 (testisin) fusion 
5 protein purified by affinity chromatography. 

Figure 8 is a diagrammatic representation of eukaryotic expression constructs, pcDNA3-Test(S- 
C), pcDNA3-Test(L-C) and pcDNA3-Test(l-297)L-C. 

10 Figure 9 is a diagranmiatic representation showing a histogram of the signal intensity from a 
Clontech Master RNA blot of the tissue distribution of PIELA2 (testisin) in RNA from 50 
different normal tissues. (A) Probed with HELA2 (testisin) specific probe; (B) Probed with 
BC0N3 specific prove which is ubiquitously expressed. The 8 tissues on the right hand side of 
the diagram are the control (negative) samples. 

15 

Figure 10 is a photographic representation of a multiple normal tissue Northern blot (Clonetcch) 
probed with: (A) HELA2 (testisin) specific probe and (b) BCON3 specific probe. 

Figure 11 is a photographic representation of agarose gel of PGR products generated by 
20 anq)lification of HELA2 (testisin) cDNA in prevasectomised and post-vasectomised ejaculate 
specimens. The HELA2 (testisin) PGR product is 464bp and the P2-macroglobulin product is 
250 bp. 

Figure 12 is a photographic representation of in situ hybridization of rat testis showing the 
25 localisation of HELA2 (testisin) mRNA to the germ cells of the testis. 

Figure 13 is a representation showing: (A) spread of normal metaphase chromosomes showing 
bright dots where HELA2 (testisin) is expressed at 16pl3.3; (B) Diagrammatic representation 
of chromosome 16pl3.3 showing location of HELA (testisin) and relationship to other disease 
30 causing genes. 
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Figure 14 is: (A) a photographic representation of northern blot analysis of HELA2 (testisin) 
mRNA showing signals in normal testis of 4 patients and absence of signal in the tumours of 
these patients; (B) a photographic representation of the locaHsation of HELA2 (testisin) protein 
in a human germ cell tumour section assessed by immunohistochemical staining using anti- 
5 HELA2 (testisin) peptide antibodies. Staining is only detected in the normal (N) tissue and not 
present in the tumour (T) tissue. 

Figure 15 is a diagrammatic representation of the genomic map of HELA2 (testisin) showing 
experimental^ determined intron/exon boundaries and relative sizes of the introns (marked with 
10 a letter) and exons (marked with a roman numeral). 

Figure 16 is a representation of HELA2 (testisin). Nucleotides in introns are in lowercase and 
exons in uppercase. The putative transcription start site is marked by +1. 

15 Figure 17 is a representation of the DNA sequence of Intron C and flanking exons showing 
where alternative mRNA splicing occurs to generate the two isoforms of HELA2 (testisin). 

F^re 18 is a representation of: (A) the cDNA sequence of the mouse homologue of HELA2 
(testisin). Catalytic residues are indicated by circles and cysteines likely involved hi disulfide 
20 bonding are indicated by squares; (B) Hydrophobicity plot of HELA2 (testisin) amino acid 
sequence. 

Figure 19 is a diagrammatic representation of chromosome 16pl3.3 showing the serine 
proteinase gene cluster which includes HELA2 (testisin). Lines represent cosmids containing 
25 the respective serine proteinase genes. 

Figure 20A is a representation of: (A) the cDNA sequence of SPOOILA (SEQ ID NO:28). 
Catalytic residues are indicated by circles and cysteins likely involved in disulfide bonding are 
indicated by squares; (B) hydrophobicity plots of SPOOILA amino acid sequence. 

30 

Figure 20B is a representation of: (A) the cDNA sequence of SP002LA (SEQ ID NO:29). 
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Catalytic residues are indicated by circles and cysteines likely involved in disulfide bonding are 
indicated by squares. (B) Hydrophobicity plot of SP002LA amino acid sequence. 

Figure 20C is a representation of: (A) the cDNA sequence of SP003LA (SEQ ID NO:30). 
5 Catalytic residues are indicated by circles and cysteines likely involved in disulfide bonding are 
indicated by squares. (B) Hydrophobicity plot of SP003LA amino acid sequence. 

Figure 21 is a photographic representation of in vitro transcription/translation of BCON3 
showing the protein products. 



- wo 98/36054 PCT/AU98/00085 



- 11 - 

A summary of the SEQ ID NOs used throughout the specification is presented in Table 1, 

TABLE 1 



SEQ ID NO 


DESCRIPTION 


1 


* PCR primer sequence 


2 


* PCR primer sequence 


3 


Nucleotide sequence of short form of HELA2 


A 


Aminrk ^TiiipniV! nf ^hnrt fnirn of HRT^A*? 


5 


Nucleotide sequence of long form of HELA2 


6 


Amino acid sequence of long f orai of HEL A2 


7 


Nucleotide acid sequence of ATC2 


8 


Amino acid sequence of ATC2 


9 


Nucleotide acid sequence of BCOM3 


10 


Amino acid sequence of BCOM3 


11 


Primers used to generate amino terminal 




tagged protein 


12 


Primers used to generate amino terminal 




tagged protein 




Primprs iKpd fn p^pnftrated carhoxv-linlced 




terminal protein 


14 


Primers used to generated carboxy-linked 




terminal protein 


15 


Peptide antigen T20-33 


16 


Peptide antigen T46-63 


17 


Peptide antigen T 175- 190 


18 


Forward primer 


19 


Reverse primer 


20 


Forward primer 


21 


Reverse primer 
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TABLE 1 (Continued) 



22 


Forward primer 


23 


Reverse primer 


24 


Serine proteinase activation motif 


25&26 


Mouse HEL A2 cDNA sequence 


27 


Human genomic DNA sequence 


28 


Clustered serine proteinase gene SPOOILA 


29 


Clustered serine proteinase gene SP002LA 


30 


Clustered serine proteinase gene SP003LA 



X=AorG 
Y=CorT 
Is Inosine. 



* Abbreviations: 

15 
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A list of single and three letter abbreviations for amino acid residues is presented in Table 2. 

TABLE 2 



5 Amino Acid Thiee-letter One-letter 

Abbreviation Symbol 



Alanine 


Ala 


A 


Arginine 


Arg 


R 


Asparagine 


Asn 


N 


Aspartic acid 


Asp 


D 


Cysteine 


Cys 


C 


Glutamine 


Gin 


Q 


Glutamic acid 


Glu 


E 


Glycine 


Gly 


G 


tCstidine 


His 


H 


Isoleucine 


De 


I 


Leucine 


Leu 


L 


Lysine 


Lys 


K 


Methionine 


Met 


M 


Phenylalanine 


Phe 


F 


Proline 


Pro 


P 


Serine 


Ser ■ 


S 


Threonine 


Thr 


T 


Tryptophan 


Trp 


W 


Tyrosine 


Tyr 


Y 


Valine 


Val 


V 


Any residue 


Xaa 


X 



30 
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DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 

The present invention is predicated in part on a genetic engineering approach to identify 
nucleotide sequences encoding serine proteinases or kinases. The genedc engineering approach 
5 is based on the use of degenerate primers corresponding to conserved regions of serine 
proteinases (amino acids flanking His- and Ser- residues) to amplify gene fragments spanning 
these regions for cDNA, using low stringency reverse transcriptase-polymerase chain reaction 
(RT-PCR). 

10 This technique has been successfully used, in accordance with the present invention, to identify 
serine proteinases and kinases useful in modulating cell activity and viability including modulating 
spermatogenesis, acting as tumour suppressors and acting as a marker for non-testicular cancers. 

Accordingly, one aspect of the present invention provides a novel molecule in isolated form 
15 involved in or associated with regulation of cell activity and/or viability. 

More particularly, the present invention contemplates a novel serine proteinase in isolated form 
comprising a sequence of amino acids encoded by a nucleotide sequence, at least a portion of 
which, is capable of being amplified by polymerase chain reaction (PGR) using the following 
20 primers: 

5' ACAGAATTCTGGGTIGTIACIGCIGCICAYTG3' [SEQ ID N0:1]; and 
5'ACAGAATTCAXIGGICCICCiaGT/AXTCICC3' [SEQ ID N0:2]; 

25 

or a complementary form of said primers* « 

Preferably, X is A or G, Y is C or T and I is inosine* 

30 In a particularly preferred embodiment, the isolated serine proteinase comprises the amino acid 
sequence substantially set forth in SEQ ID N0:4 or an amino acid sequence having at least about 
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50% similarity to all or part thereof. This serine proteinase is referred to herein as a short 
isoform of "HELA2" or "HELA2 (testisin)". The terms "HELA2" and "testisin" are used 
interchangedly throughout the subject specification to refer to the same molecule. 

5 In another preferred embodiment, the amino acid sequence of the serine proteinase is 
substantially as set forth in SEQ ID NO:6 or an amino acid sequence having at least about 50% 
similarity to all or part thereof. This serine proteinase is the long isoform of HELA2 or HELA2 
(testisin). 

10 Yet another preferred embodiment of the present invention provides an amino acid sequence 
substantially as set forth in SEQ ID NO:8 or an amino acid sequence having at least about 50% 
similarity to all or part thereof. This serine proteinase is referred to herein as "ATC2". 

Another aspect of the present invention relates to a serine proteinase in isolated form comprising 
15 a sequence of amino acids encoded by a nucleotide sequence substantially as set forth in SEQ 
ID NO: 3 or a nucleotide sequence having at least 50% similarity to all or part thereof or a 
nucleotide sequence capable of hybridising to the sequence set forth in SEQ ID NO:3 under low 
stringency conditions at 42"C. 

20 Still another aspect of the present invention is directed to a serine proteinase in isolated form 
conprising a sequence of amino acids encoded by a nucleotide sequence substantially as set forth 
in SEQ ID NO:5 or a nucleotide sequence having at least 50% similarity to all or part thereof or 
a nucleotide sequence capable of hybridising to the sequence set forth in SEQ ID N0:5 under 
low stringency conditions at 42*'C. 

25 

In another aspect of the present invention, there is provided a serine proteinase in isolated form 
comprising a sequence of amino acids encoded by a nucleotide sequence substantially as set forth 
in SEQ ID NO:7 or a nucleotide sequence having at least 50% similarity to all or part thereof or 
a nucleotide sequence capable of hybridismg to the sequence set forth in SEQ ID NO:7 under 
30 low stringency conditions at 42°C. 
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Another embodiment of the present invention is directed to a kinase in isolated form comprising 
an amino acid sequence substantially as set forth in SEQ ID NO: 10 or having 50% amino acid 
similarity to all or part thereof. This kinase is referred to herein as "BCON3". 

S In a related embodiment, the kinase comprises an anuno acid sequence encoded by a nucleotide 
sequence substantial^ as set forth m SEQ ID N0:9 or a nucleotide sequence having at least 50% 
similarity to all or part of the nucleotide sequence set forth in SEQ ID NO:9 or a nucleotide 
sequence capable of hybridising to the nucleotide sequence set forth in SEQ ID NO:9 under low 
stringency conditions at 42"C. 

10 

The present invention further provides an isolated nucleic acid molecule comprising a sequence 
of nucleotides encoding or complementary to a sequence encoding a novel molecule involved 
in or associated with regulation of cell activity and/or viability. Preferably, the nucleic acid 
molecule is capable of being amplified by PCR using the primers set forth in SEQ ID NO:l 
15 and/or SEQ ID NO:2. 

More particularly, the present invention further provides an isolated nucleic acid molecule 
comprising a sequence of nucleotides substantially as set forth in SEQ ID NO: 3 or having 50% 
similarity to all or part thereof or a nucleic acid molecule capable of hybridising to SEQ ID NO: 3 
20 under low stringency conditions at 42°C. 

Another aspect of the present invention provides an isolated nucleic acid molecule comprising 
a sequence of nucleotides substantially as set forth in SEQ ID NO:5 or having 50% similarity to 
all or part thereof or a nucleic acid molecule capable of hybridising to SEQ ID N0:5 under low 
25 stringency conditions at 42°C. 

Another aspect of the present invention is directed to an isolated nucleic acid molecule 
conprising a sequence of nucleotides substantially as set forth in SEQ ID NO:7 or having 50% 
similarity to all or part thereof or a nuclek; acid molecule capable of hybridising to SEQ ID N0:7 
30 under low stringency conditions at 42T 
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Still another aspect of the present invention provides an isolated nucleic acid rriolecule 
conprising a sequence of nucleotides substantially as set forth in SEQ ED N0:9 or haying 50% 
similarity to all or part thereof or a nucleic acid molecule capable of hybridising to SEQ ID NO:9 
under low stringency conditions at 42°C. 

5 

Reference herein to a low stringency includes low stringency at 42**C includes and encompasses 
from at least about 1% v/v to at least about 15% v/v formamide and from at least about IM to 
at least about 2M salt for hybridisation, and at least about IM to at least about 2M salt for 
washing conditions. Alternative stringency conditions may be applied where necessary, such as 

10 medium stringency, which includes and encompasses from at least about 16% v/v to at least 
about 30% v/v formamide and from at least about 0.5M to at least about 0.9M salt for 
hybridisation, and at least about 0.5M to at least about 0.9M salt for washing conditions, or high 
stringency, which includes and encompasses from at least about 31% v/v to at least about 50% 
v/v formamide and from at least about 0.0 IM to at least about 0.1 5M salt for hybridisation, and 

15 at least about O.OIM to at least about 0.15M salt for washing conditions. 

Reference herein to similarity to "part" of a sequence means similarity to at least about 4 
contiguous amino acids or at kast about 12 contiguous nucleotide bases an4 more preferably at 
least about 7 contiguous amino acids or at least about 21 contiguous nucleotide bases. 

20 

The term "similarity" includes exact identity between sequences or, where the sequence differs, 
different amino acids may be related to each other at the structural, functional, biochemical 
and/or conformational levels. 

25 The term "isolated" includes biological purification and biological separation and encompasses 
molecules having undergone at least one purification, concentration or separation step relative 
to its natural environment. For example, a preparation may comprise at least about 10%. 
preferably at least about 20%, more preferably at least about 30%, still more preferably at least 
about 50% or greater of the molecule relative to at least one other component in a composition 

30 as determined by activity, mass, amino acid content, nucleotide content or other convenient 
means. 
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Hereinafter, the molecutes of the present invention are referred to as a "proteinase/kinase". The 
term "proteinase/kinase" includes the serine proteinases HELA2 (testisin) and ATC2 and the 
kinase BC0N3. The proteinase/kinase of the present invention may be in isolated, naturally 
occurring form or recombinant or synthetic form or chemical analogues thereof. 

5 

The proteinase/kinase of the present invention is preferably of human origin but fix)m non-human 
origins are also encompassed by the present invention. Non-human animals contemplated by the 
present invention include primates, livestock animals (e.g. sheep, cows, pigs, goats, horses, 
donkeys), laboratory test animals (e.g. mice, rats, guinea pigs, hamsters, rabbits), domestic 
10 con:5)anion animals (e.g. dogs, cats), birds (e.g. chickens, geese, ducks and other poultry birds, 
game birds, emus, ostriches) and captive wild or tamed animals (e.g. foxes, kangaroos, dingoes). 
The present invention also encompasses a proteinase/kinase homologue from Xenopus and 
plants. 

15 The nucleic acid molecules encoding a proteinase/kinase may be genomic DNA, cDNA or RNA 
such as mRNA. 

Yet another aspect of the present invention provides an isolated serine proteinase encoded by a 
gene proximal to a cluster of genes on a mamnialian chromosome. The cluster of genes is 

20 preferably on human chromosome 16pl3.3 or its equivalent in a non-human species. The cluster 
is made up of genes all encoding or having the potential to encode a serine proteinase or 
homologue, derivative or functional or evolutionary equivalent thereof. Preferably, the gene 
cluster comprises two or more of genes comprising a nucleotide sequence selected from SEQ 
ID N0:3 and 5 (HELA2, short and long forms, respectively) and SEQ ID NO:28 (SPOOILA), 

25 SEQ ID NO:29 (SP002LA), SEQ ID NO:30 (SP003LA) and SP004LA (see Figure 19) or a 
nucleotide sequence having at least 50% similarity to any one of those sequences or capable of 
hybridizing to any one of those sequences under low stringency conditions at 42'*C. 



30 



The term "proximal" is used in its broadest sense to mean a gene cluster and includes a gene 
within proximity to another gene. 
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Another aspect of the present invention contemplates a method for cloning a nucleotide sequence 
encoding a novel serine proteinase, said method comprising screening a nucleic acid library with 
said one or more or oligonucleotides defined by SEQ ID N0:1 and/or SEQ ID NO:2 and 
obtaining a clone therefrom which encodes said novel serine proteinase or part thereof. 

5 

Preferably, the nucleic acid library is genomic DNA, cDNA, genomic or mRNA library. 

Preferably, the nucleic acid library is a cDNA expression library. 

10 Preferably, the nucleic acid library is of human origin such as from brain, liver, kidney, neo-natal 
tissue, embryonic tissue, tumour or cancer tissue. 

With respect to HELA2 (HELA2 (testisin)), significant expression is generally only found in 
normal testis. Accordingly, the present invention extends to nucleic acid molecules capable of 
15 tissue-specific or substantially tissue-specific expression. 

Still another embodiment contemplates the promoter or a functional part thereof of the genomic 
gene encoding the subject proteinase/kinase of the present invention. The promoter may readily 
be obtained by, for example, "chromosome walking". A particularly useful promoter is from 
20 HELA2 (testisin) which can be regarded as a testis specific promoter. This promoter can be 
used, for example, to direct testis specific expression of genetic sequences operably linked to the 
promoter and may be used inter alia gene therapy or modulation of fertility. 

The present invention further contemplates a range of derivatives of the subject 
25 proteinase/kinase. Derivatives include fi-agments, parts, portions, mutants, homologues and 
analogues of the subject polypeptides and corresponding genetic sequences. Derivatives also 
include smgle or multiple amino acid substitutions, deletions and/or additions to the subject 
molecules or single or multiple nucleotide substitutions, deletions and/or additions to the genetic 
sequence encoding the molecules. "Additions" to amino acid sequences or nucleotide sequences 
30 include fiisions with other peptides, polypeptides or proteins or fusions to nucleotide sequences. 
Reference herein to the serine proteinase and kinase includes reference to all derivatives thereof 
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including functional derivatives or inununologically interactive derivatives. 

Analogues of the subject serine proteinase and kinase contemplated herein include, but are not 
limited to, modification lo side chains, incorporating of unnatural amino acids and/or their 
5 derivatives during peptide, polypeptide or protein synthesis and the use of crpsslinkers and other 
methods which impose conformational constraints on the proteinaceous molecule or their 
analogues. 

Examples of side chain modifications contemplated by the present invention include 
10 modifications of amino groups such as by reductive alkylation by reaction widi an aldehyde 
followed by reduction with NaBH4; amidination with methyiacetimidate; acylation with acetic 
anhydride; carbamoylation of amino groups with cyanate; trinitrobenzylation of amino groups 
with 2, 4, 6-trinitrobenzene sulphonic acid (TNBS); acylation of amino groups with succinic, 
anhydride and tetrahydrophthalic anhydride; and pyridoxylation of lysine with pyridoxal-5- 
15 phosphate followed by reduction witti NaBH4. 

The guanidine group of arginine residues may be modified by the formation of heterocyclic 
condensation products with reagents such as 2,3-butanedione, phenylglyoxal and glyoxal. 

20 The carboxyl group may be modified by carbodiimide activation via O-acylisourea formation 
followed by subsequent derivitisation, for example, to a corresponding amide. 

Sulphydryl groups may be modified by methods such as carboxymethylation with iodoacetic acid 
or iodoacetaraide; performic acid oxidation to cysteic acid; formation of a mixed disuiphides 
25 with other thiol compounds; reaction with maleimide, maleic anhydride or other substituted 
maleimide; formation of mercurial derivatives using 4-chloromercuribenzoate, 4- 
chloromercuriphenylsulphonic acid, phenylmercury chloride. 2-chloromercuri-4-nitrophenol and 
other mercurials; carbamoylation with cyanate at alkaline pH. 

30 Tryptophan residues may be modified by, for example, oxidation with N-bromosuccinimide or 
alkylation of the indole ring with 2-hydroxy-5-mtrobenzyl bromide or sulphenyl halides. 
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Tyrosine residues on the other hand, may be altered by nitration with tetranitromethane to form 
a 3-nitrotyrosine derivative. 

Modification of the imidazole ring of a histidine residue may be accomplished by alkyiation with 
5 iodoacetic acid derivatives or N-carbethoxylation with diethylpyrocarbonate. 

Examples of incorporating unnatural amino acids and derivatives during peptide synthesis 
include, but are not limited to, use of norleucine, 4-amino butyric acid, 4-amino-3-hydroxy"5- 
phenylpentanoic acid, 6-aminohexanoic acid, t-butylglycine, norvaline, phenylglycine, ornithine, 
10 sarcoshie, 4-amino-3-hydroxy-6-methylheptanoic acid, 2-thienyl alanine and/or D-isomers of 
amino acids. Alistof unnatural amino acid, contemplated herein is shown in Table 3. 
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TABLE3 



Non-conventional 


Code 


Non-conventional 


Code 


amino acid 




amino acid 




a-aminobutyric acid 


Abu 


L-N-methylalanine 


Nmala 


a-amino-a-methylbutyrate 


Mgabu 


L-N-methyiarginine 


Nmarg 


aniinocyclopropane- 


Cpro 


L-N-methylasparaginc 


Nmasn 


carboxylate 




L-N-methylaspartic acid 


Nmasp 


10 aminoisobutyric acid 


Aib 


L-N-methylcysteine 


Nmcys 


aminonorbomyl- 


Norb 


L-N-methylglutamine 


Nmgln 


carboxylate 




L-N-methylglutamic acid 


Nmglu 


cyciohexylalanine 




Chexa L-N-methylhistidine 


Nmhis 


cyclopentylalanine 


Cpen 


L-N-methylisolleucine 


Nmile 


15 D-alanine 


Dai 


L-N-methylleucine 


Nmleu 


D-arginine 


Darg 


L-N-methyllysine 


Nmlys 


D-aspartic acid 


Dasp 


L-N-methylmethionine 


Nmmet 


D-cysteine 


Dcys 


L-N-methylnorleucine 


Nmnle 


D-glutamine 


Dgln 


L-N-methylnorvaline 


Nmnva 


20 D-glutamic acid 


Dglu 


L-N-methylomithine 


Nmorn 


D-histidine 


Dhis 


L-N-methyiphenylalanine 


Nmphe 


D-isoleucine 


Dile 


L-N-methylproline 


Nmpro 


D-leucine 


Dleu 


L-N-methylserine 


Nniser 


D-lysine 


Dlys 


L-N-mcthiylthreonine 


Nmthr 


25 D-methionine 


Dmet 


L-N-methyltryptophan 


Nmtrp 


D-omithine 


Dorn 


L-N-methyltyrosine 


Nmtyr 


D-phenylalanine 


Dphe 


L-N-methylvaline 


Nmval 


D-proline 


Dpro 


L-N-methylethylglycine 


Nmetg 


D-serine 


Dser 


L-N-methyl-t-butylglycine 


Nmtbug 


30 D-threonine 


Dthr 


L-norleucine 


NIe 


D-tryptophan 


Dtrp 


L-norvaline 


Nva 
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D-tyrosine 


Dtyr 


D-valine 


Dval 


D-a-methylalanine 


Dmala 


D-a-methylargininc 


Dmarg 


5 D-a-mcthylasparagine 


Dmasn 


D-a-methylaspartate 


Dmasp 


D-a-methylcysteine 


Dmcys 


D-a-methylglutamine 


Dmgln 


D-a-methylhistidine 


Dmhis 


10 D-a-methylisoleucine 


DmUe 


D-a-methylleucine 


Dmleu 


D-a-methyllysine 


Dmlys 


D-a-methylmethionine 


Dmmet 


D-a-methylomithine 


Dmom 


15 D-a-methylphenylalanine 


Dmpbe 


D-a-methylpioline 


Diiq)ro 


D-a-methylserine 


Dmser 


D-a-methylthreonine 


Dmthr 


D-a-methyltryptophan 


Dmtrp 


20 D-a-methyltyrosine 


Dmty 


D-a-methylvaline 


Dmval 


D-N-methylalanine 


Dnmala 


D-N-methylarginine 


Dnmarg 


D-N-methylasparagine 


Dnmasn 


25 D-N-methylaspartate 


Dnmasp 


D-N-methylcysteine 


Dnmcys 


D-N-methylglutamine 


Dnmgln 


D-N-methylglutamate 


Dnmglu 


D-N-methylhistidine 


Dnmhis 


30 D-N-methylisoleucine 


Dnmile 


D-N-methylleucine 


Dnmleu 
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tt-metnyi-aminoisoDuiyrdie 




a-methyl-Y~aminobutyrate 


iVlgoUU 


a-methylcyclonexyiaianine 




a-methylcylcopentylalanine 


lVxCpt7ll 


a-metnyl-ct-naptnyiaiaiiine 




a-methylpenicillamine 


lYipen 


N-(4-aminobutyl)glycine 


INglU 


N-(2-aminoethyl)glycine 


Naeg 


N-(3-aminopropyl)glycine 


Nom 


N-amino-a-methylbutyrate 


xNmaaDU 


a-napthy lal anine 




N-benzylglycine 


Nnhp 


JN -(z-caroamy leinyi ;giycme 


Nain 


N -(carbamylmeinyl ;giycme 


i.Nc«oll 


Pi -(z-carDOxyemyi;giycine 


Nahi 


IM-(cartK)xymemyi;giycme 




N-cycloDutyigiycme 


Mrhiit 


N-cycloheptylglycine 




N-cyciohexylglycine 




N-cyclodecylglycine 




rs -cyicooouecyigiy ciuc 




IN -cycioQCiyigiycme 


1^ V/V/wt 


iN-cyciopropyigiydiic 


Ncpro 


i\-cyciounaccyigiycinc 




N-(2,2-diphenylethyl)glycine 


Nbnm 


N-(3,3-diphenylpropyl)glycine 


Nbhe 


N-(3-guanidinopropyl)glycine 


Narg 


N-( 1 -hydroxyethyl)glycine 


Nthr 


N-(hydroxyethyI))glycine 


Nser 


N-(imidazolylethyl))glycine 


Nhis 


N-(3-indolylyethyl)glycine 


Nhtrp 
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ri-N-methvl 1 vsine 


Dnmlys 


N-methyl-y-aniinobutyrate 


Nmgabu 


N[-mf»l"h vl f* vrl nhf* v V 1 al ani ne 


NmchexE 


D-N-methylmethionine 


Dnnunet 


JL/**1> "iiicuiy lui 111111111^ 


Dntnom 


N-methylcyclopentylalanine 


Nmcpen 


IN -ineuiy igiy ^.'iiic 


Nala 


D-N-methylphenylalanine 


Dnmphe 


^ XI mAtViT/lciminnicAKiitvrJitP 
J lN~IIlClliyi<UlUllUlJ>\^L/uijf iat5> 


Nmaiti 


D-N-methylproline 


Dnmpro 


IN % 1 ■•uieiiiyipropyi^giy v.'iiic 


Nile 


D-N-methvlserine 


Dnmser 


KT-Z'O-Tn^t'hvlnrrtnvl Wl vrine 


Nleu 


D-N-methylthreonine 


Dnmthr 


•-ixiciiiy lU. y piu^iuiii 


TDnmtrD 


N-( 1 -methy iethyl)glycine 


Nval 


^-.m#»flit/lf\/fY^CfTiP 
-iiic my iiy 1 uaiiic 


T^nmtvr 


N-methyla-napthylalanine 


Nmanap 


1 n XI mofli\/1i/a1tn^ 

1 u JL/'iN -nicuiy 1 vdiiiic 


Ofimval 

1.^AA1AAV CIA 


N-methylpeniciilamine 


Nmpen 


'v/-siminrkHiitvrir' 5iPlH 
Y'allUiluuuiyLiw AL^iu 


Gabu 


N-fp-hvdroxvDhenvDfflvcine 


Nhtyr 


j^-r-Duiyi]giyduc 




N-f tliiomethvI)£ivcine 


Ncys 


1^-ciiiy igiy t'liic 


Ftp 


penicillamine 


Pen 


l^^IlUliluuiiCityiuXcuiiiiw' 


Hnhe 


L-cc-methylalanine 


Mala 


1 *> T — /v-m^*tVi\/ljir<yininf' 


AYJ.CU^ 


L-a-methylasparagine 


Masn 


j^~vt'*iii&ii.iy lodLicu. ualw 


Masp 


L-a-methyl-f-butylglycine 


Mtbug 


T -ft-methvlcv^teine 


Mcys 


Lrmethylethylglycine 


Metg 


T -rf-m£*thvlpliitamine 


Mffln 


L-ot-methylglutamate 


Mglu 


T -<£-fTi^^thv1hi<HHiTie 

Im^ Ui ilAWtlA YAAAlijLAVAAAAw 


Mhis 


L-a-methylhomophenyialanine 


Mhphe 


20 L-tt'-inetnynsoieucine 


jvuie 


iN~^z~nicuiyiuuuviiiyi^giyv»iiic 


Nmet 


L-a-methylleucine 


Mieu 


L-a-methyllysine 


Mlys 


L-a-methylmethionine 


Mmet 


L-a-methylnorleucine 


Mnle 


L-a-methylnorvaline 


Mnva 


L-a-methyloraithine 


Morn 


L-a-methylphenylalanine 


Mphe 


L-a-methylproline 


Mpro 


25 L-a-methylserine 


Mser 


L-a-methylthreonine 


Mthr 


L-a-methyltryptophan 


Mtrp 


L-a-methyltyrosine 


Mtyr 
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L-a-methylvaline 

N~(N-(2,2-diphenylethyl) 

carbamylmethyl)glycine 



Mval 



L-N-methylhomophenylalanine 
N-(N-(3 ,3 -diphenylpropy 1) 
carbamylmethyOglycine 



Nmhphe 



Nnbhm 



Nnbhe 



1 -carboxy- l-(2,2-diphenyl- Nmbc 
5 ethylainino)cyclopropane 



Crosslinkers can be used, for example, to stabilise 3D conforaiations, using homo-bifunctional 
crosslinkers such as the bifunctional imido esters having (CH2)n spacer groups with n=l to n=6, 

10 glutaraldehyde, N-hydroxysuccinimide esters and hetero-bifunctional reagents which usually 
contain an amino-reactive moiety such as N-hydroxysuccinimide and another group specific- 
reactive moiety such as maleimido or dithio moiety (SH) or carbodiimdde (COOH). In addition, 
peptides can be conformationally constrained by, for example, incorporation of Q and 1^- 
methylamino acids, introduction of double bonds between Q and Cp atoms of amino acids and 

15 the formation of cyclic peptides or analogues by introducing covalent bonds such as forming 
an amide bond between the N and C termini, between two side chains or between a side chain 
and the N or C terminus. 

These types of modifications may be inip>rtant to stabilise the proteinase/kinase if administered 
20 to an individual or for use as a diagnostic reagent. 

The present invention further conteiT5)lates chemical analogues of the proteinase/kinase capable 
of acting as antagonists or agonists of the native molecules or which can act as functional 
analogues of the native molecules. For example, an antagonist may be a proteinase inhibitor. 
25 Chemical analogues may not necessarily be derived from the subject enzymes but may share 
certain conformational similarities. Alternatively, chemical analogues may be specifically 
designed to mimic certain physiochemical properties of the serine proteinases or kinases. 
Chemical analogues may be chemically synthesised or may be detected following, for example, 
natural product screening. 



30 



The identification of the novel molecules of the present invention permits the generation of a 



\VO 98/36054 



PCT/AU98/000S5 



-26- 

range of therapeutic molecules capable of modulating expression of their native counterparts 
or modulating their activity. Modulators contemplated by the present invention includes 
agonists and antagonists of proteinase/kinase expression. Antagonists of proteinase/kinase 
expression include antisense molecules, ribozymes and co-suppression molecules. Agonists 
5 include molecules which increase promoter ability or interfere with negative regulatory 
mechanisms. Agonists of proteinase/kinase include molecules which overcome any negative 
regulatory mechanism. Antagonists of the proteinase/kinase include antibodies and inhibitor 
peptide fragments. 

10 Other derivatives contemplated by the present invention include a range of glycosylation 
variants from a completely unglycosylated molecule to a modified glycosylated molecule. 
Altered glycosylation patterns may result from expression of recombinant molecules in different 
host cells. 

15 Another embodiment of the present invention contemplates a method for modulating 
expression of proteinase/kinase in a human, said method comprising contacting the 
proteinaseAdnase gene encoding proteinase/kinase with an effective amount of a modulator of 
proteinase/kinase expression for a time and under conditions sufficient to up-regulate or down- 
rcgulate or otherwise modulate expression of proteinase/kinase. For example, a nucleic acid 

20 molecule encoding proteinase/kinase or a derivative thereof may be introduced into a cell 
conversely, proteinase/kinase antisense sequences such as oligonucleotides may be introduced. 

Another aspect of the present invention contemplates a method of modulating activity of 
proteinase/kinase in a human, said method comprising administering to said mammal a 
25 modulating effective amount of a molecule for a time and under conditions sufficient to increase 
or decrease proteinase/kinase activity. The molecule may be a proteinaceous molecule or a 
chemkral entity and may also be a derivative of proteinase/kinase or its receptor or a chemical 
analogue or truncation mutant of proteinase/kinase or its receptor. 

30 One particularly useful serine proteinase, HELA2 (testisin), is implicated in spermatogenesis 
and in testicular tumour development. It is proposed, in accordance with the present invention, 
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that HELA2 (testisin) is involved in fertility and infertility. 

Northern blot analysis of Poly A+ RNA from normal tissue specimens showed a unique tissue 
distribution for HELA2 (testisin) with significant expression only in the testis. No signals are 
5 detected in any other tissue, with the exception of a minor signal in salivaiy gland. By RT-PCR, 
HELA2 (testisin) is detected in the ejaculate of normal males but not in the ejaculate of 
vasectomised males indicating that it is of germ cell origin. Hybridization data in situ indicated 
that HELA2 (tesdsin) is produced by immature germ cells in the testis, located near the basal 
epithelium and, hence, is an important factor for normal sperm maturation; defective expression 

10 or mutations would contribute to primary male infertility. Further, it is from the prccurcors of 
spermatocytes that 95% of testicular germ cell tumours, such as seminomas, embryonal 
carcinomas and teratocarcinomas arise. In the normal testis, germ cells undergo meiosts to 
become spermatocytes, but in individuals at risk, the germ cells continue to proliferate giving 
rise to germ cell tumours. Although not wishing to limit the present invention to any one theory 

15 or mode of action, it is proposed, in accordance with present invention, that HELA2 (testisin) 
functions at this critical juncture - cell growth versus maturation. 

Familial forms of testicular cancer are rare, but linkage analysis of a large family with familial 
seminoma has demonstrated Unkage to chromosome 16p, within a region adjacent to the 

20 HPKDl (human polycystic kidney disease) gene at I6pl3.3. The HELA2 (testisin) gene 
localises to chromosome 16pl3.3 which is near the telomere of chromosome 16 and is 
associated witti high genetic instability. The HELA2 (testisin) gene is sandwiched between four 
genes wliich underiie other human genetic disorders; HPKDl and tuberous sclerosis (TSC2) 
on the one side, and familial mediterranean fever (MEF) and Rubenstein-Taybi syndrome 

25 (RSTS) on the other side. The question of whether HELA2 (testisin) may be a tumour 
suppressor for seminoma was determined by comparing HELA2 (testisin) mRNA expression 
in normal testes with corresponding germ cell tumours from patients with seminoma. HELA2 
(testisin) was not detectable in the tumours of these patients, but was present in the 
corresponding normal testis specimens, indicative of a tumour suppressor role of HELA2 

30 (testisin) in testicular germ cell cancers. 
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Although restricted in normal tissues to the testes. HELA2 (testisin) is expressed in tumours 
of the colon, pancreas, prostate and ovary. This indicates that HELA2 (testisin) contributed 
to tumourigenesis and, therefore, has an application as a marker and also as a therapeutic anti- 
tumour target in these types of cancers. 

5 

These data point to a potentially very significant role for HELA2 (testisin) in testicular germ 
ceU maturation (spermatogenesis) as weU as in the genesis of testicular germ cell tumours. In 
accordance with the present invention, it is proposed that expression of HELA2 (testisin) by 
immature germ ceUs may be essential for sperm cell development, such that loss of HELA2 
10 (testisin) expression leads to continued and uncontrolled proliferation of immature germ cells 
leading to subsequent tumourigenesis. Germ cells wherein HELA2 (testisin) is mutated or 
absent may thus be prone to malignant transformation because of an inability to progress along 
the differentiation pathway. 

15 HELA2 (testisin) is weU-positioned to anchor on the surface of the germ cell where it would 
participate in a range of proteolytic activities, including cell migration, differentiation and/or 
activation of growth factors, receptors, or cytokines as well as initiate additional proteolytic 
cascades. Although not intending to limit the present invention to any one theory or mode of 
action, it is proposed, in accordance with the present invention, that the proteolytic target of 

20 HELA2 (testisin) is a cytokine, receptor or growth factor essential for either germ cell 
proliferation or differentiation - ie. HELA2 (testisin) may either inactivate a factor important 
for proUferation, or activate a factor which promotes differentiation. Thus, HELA2 (testisin) 
may be critical in the regulation of specific cytokines, cytokine receptors or growth factors by 
means of post-translational proteolytic processing. That HELA2 (testisin) is not present in 

25 otiier normal tissues of the male urogenital ti^ct, such as tiie prostate and kidney, also argues 
for such a role specific to the testis. 

Diagnostic and therapeutic applications for HELA2 (testisin) have die potential to be wide- 
ranging both in the cancer and fettility/infertiUty markets. In tumours, other than the testis, it 
30 is desirable to block or inhibit HELA2 (testisin) activity. As HELA2 (testisin) is a member of 
ttfi serine proteinase femily. for which prototype crystal structures are known and the catalytic 
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mechanism reasonably weU characterised, the design of drugs that target HELA2 (testisin) 
proteolytic activity as an anti-tumour therapy should be relatively straightforward. As HELA2 
(testisin) is pi^cted to be anchored on the cell surface, there would not be difficulties 
associated wUh deUveiy of drugs to intracellular compartments. Further, it is very possible that 
5 some tumour-associated HELA2 (testisin) may be proteolytically cleaved from the surface of 
tumour cells, and the extracellular domain detectable in patient serum as a potential tumour 
associated marker. 

Testicular cancer is the commonest malignancy in men aged 20-44 years. Early diagnosis 
10 correlates which an improved chance of cure and in a reduction in the severity of treatment. 
If the cancer is not treated early, it becomes very aggressive. The incidence of testicular cancer 
is significant (9/100,000) and has been rising over the last 10 years. In testicular germ cell 
tumours, such as seminoma, deUvery of recombinant HELA2 (testisin) using gene therapy 
techniques could lead to arrest of tumour growth and potentiaUy aUow commencement of 
15 normal sperm ceD maturation and differentiation, thereby reducing the need for surgical removal 
of the testis (orchidectomy). This may be particularly effective for patients who have already 
had one testicle removed because of testicular cancer. The risk of contralateral testicular cancer 
is increased in these patients and tumour development could be arrested through early treatment 
with HELA2 (testisin) to arrest growth and assist maturation of germ cells. The finding of 
20 mutant forms of HELA2 (testisin) may also lead to new markers for seminoma. Unlike other 
testicular non-seminomacancei-s where a-fetoprqtein and P-HCG are frequently elevated and 
can be used as tumour maricers, the lack of an adequate maricer for seminoma creates difficulties 
with staging and patient follow-up. 

25 A demonsti^ed role for HELA2 (testisin) in sperm maturation and development would likely 
lead to improved diagnosis and new directed therapeutics for male primary infertiUty. Primary 
male infertility is responsible for conception problems in 5-10% of couples and the world 
market for a tiierapeutic in this area would be very substantial. DeUvery of recombinant 
HELA2 (testisin) could assist sperm maturation and potentially trigger normal sperm 

30 development in some of these cases. The identification of mutant forms of HELA2 (testisin) 
could aid in diagnosis of infertiUty. If HELA2 (testisin) does not prove to be a tumour 
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suppressor, but is important for sperm maturation, it could provide a new target for the 
development of a male contraceptive. If hormonal regulation of HELA2 (testisin) can be 
demonstrated, HELA2 (testisin) may prove effective for the treatment of conditions arising 
from dysfunctional hormal responses, such as cryptorchidism, which is associated with both 
5 infertility and seminoma development. 

Accordingly, the present invention contemplates a pharmaceutical composition comprising 
proteinase/kinase or a derivative thereof or a modulator of proteinase/kinase expression or 
proteinase/kinase activity and one or more pharmaceutically acceptable carriers and/or diluents. 
10 These components are referred to as the "active ingredients" and include, for example, HELA2 
(testisin). 

The pharmaceutical forms suitable for injectable use include sterile aqueous solutions (where 
water soluble) and sterile powders for the extemporaneous preparation of sterile injectable 

15 solutions. It must be stable under the conditions of manufacture and storage and must be 
preserved against the contaminating action of microorganisms such as bacteria and fiingi. The 
carrier can be a solvent or dispersion medium containing, for example, water, ethanol, polyol 
(for example, glycerol, propylene glycol and liquid polyethylene glycol, and the like), suitable 
mixtures thereof, and vegetable oils. The proper fluidity can be maintained, for example, by the 

20 use of a coating such as lecithin, by the maintenance of the required particle size in the case of 
dispersion and by the use of superfactants. The preventions of the action of microorganisms 
can be brought about by various antibacterial and antifungal agents, for example, parabens, 
chlorobutanol, phenol, sorbic acid, thimersal and the like. In many cases, it will be preferable 
to include isotonic agents, for example, sugars or sodium chloride. Prolonged absorption of 

25 the injectable compositions can be brought about by the use in the compositions of agents 
delaying absorption, for example, aluminum monostearate and gelatin. 

Sterile injectable solutions are prepared by incorporating the active compounds in the required 
amount in the appropriate solvent with various of the other ingredients enumerated above, as 
30 required, followed by filtered sterilization. Generally, dispersions are prepared by incorporating 
the various sterilized active ingredients into a sterile vehicle which contains the basic dispersion 
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medium and the required other ingredients from those enumerated above. In the case of sterile 
powders for the preparation of sterile injectable solutions, the preferred methods of preparation 
are vacuum drying and the freeze-drying technique which yield a powder of the active 
ingredient plus any additional desired ingredient from previously sterile-filtered solution thereof. 

5 

When the active ingredients are suitably protected they may be orally administered, for example, 
with an inert diluent or with an assimilable edible carrier, or it may be enclosed in hard or soft 
shell gelatin capsule, or it may be compressed into tablets, or it may be incorporated directly 
with the food of the diet. For oral therapeutic administration, the active compound may be 

10 incorporated withexcipients and used in the form of ingestible tablets, buccal tablets, troches, 
capsules, elixirs, suspensions, syrups, wafers, and the like. Such compositions and preparations 
should contain at least 1% by weight of active compound. The percentage of the compositions 
and preparations may, of course, be varied and may conveniently be between about 5 to about 
80% of the weight of the unit. The amount of active compound in such therapeutically useful 

13 compositions in such that a suitable dosage will be obtained. Preferred compositions or 
preparations according to the present invention are prepared so that an oral dosage unit form 
contains between about 0. 1 //g and 2000 mg of active compound. 

The tablets, troches, pills, capsules and the like may also contain the components as Usted 
20 hereafter. A binder such as gum, acacia, com starch or gelatin; excipients such as dicalcium 
phosphate; a disintegrating agent such as corn starch, potato starch, alginic acid and the like; 
a lubricant such as magnesium stearate; and a sweetening agent such a sucrose, lactose or 
saccharin may be added or a flavouring agent such as peppermint, oil of wintergreen. or cherry 
flavouring. When the dosage unit form is a capsule, it may contain, in addition to materials of 
25 the above type, a Uquid carrier. Various other materials may be present as coatings or to 
otherwise modiiy the physical form of the dosage unit. For instance, tablets, pills, or capsules 
may be coated with shellac, sugar or both. A syrup or elixir may contain the active compound, 
sucrose as a sweetening agent, methyl and propylparabens as preservatives, a dye and 
flavouring such as cherry or orange flavour. Of course, any material used in preparing any 
30 dosage unit form should be pharmaceutically pure and substantially non-toxic in the amounts 
employed. In addition, the active compound(s) may be incorporated into sustained-ielease 
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preparations and formulations. 

Pharmaceutically acceptable carriers and/or diluents include any and all solvents, dispersion 
media, coatings, antibacterial and antifungal agents, isotonic and absorption delaying agents and 
5 the like. The use of such media and agents for pharmaceutical active substances is well known 
in the art. Except insofar as any conventional media or agent is incompatible with the active 
ingredient, use thereof in the therapeutic compositions is contemplated. Supplementary active 
ingredients can also be incorporated into the compositions. 

10 It is especially advantageous to formulate parenteral compositions in dosage unit form for ease 
of administration and uniformity of dosage. Parental compositions are generally suitable for 
administration by the intravenous, subcutaneous or intramuscular routes amongst other routes 
of administration. Dosage unit form as used herein refers to physically discrete units suited as 
unitary dosages for the mammalian subjects to be treated; each unit containing a predetermined 

15 quantity of active material calculated to produce the desired therapeutic effect in association 
with the requaed pharmaceutical carrier. The specification for the novel dosage unit forms of 
the invention are dictated by and direcfly dependent on (a) the unique characteristics of the 
active material and the particular therapeutic effect to be achieved, and (b) the limitations 
inherent in the art of conf>ounding such an active material for the treatment of disease in living 

20 subjects having a diseased condition in which bodily health is impaired as herein disclosed in 
detail. Other forms of administration include but are not Umited to intranasal, buccal, rectal, 
suppository, inhalation, intracerebral and intraperitoneal. 

The principal active ingredient is compounded for convenient and effective administration in 
25 effective amounts with a suitable pharmaceutically acceptable carrier in dosage unit form as 
hereinbefore disclosed. A unit dosage form can, for example, contain the principal active 
compound in amounts ranging from 0.5 |ig to about 2000 mg. Expressed in proportions, the 
active confound is generally present in from about 0.5 ^ig to about 2000 mg/ml of carrier. In 
the case of compositions containing supplenientary active ingredients, the dosages are 
30 determined by reference to the usual dose and manner of administration of the said ingredients. 
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The effective amounts include antiounts calculated or predicted to have the desired effect and 
range from at least about 0.01 ng/kg body weight to about 10,000 mg/kg body weight. 
Alternative amounts include 0. 1 ng/kg body weight to about 1000 ng/kg body weight. 

5 The pharmaceutical composition may also con?)rise genetic molecules such as a vector capable 
of transfecting target cells where the vector carries a nucleic acid molecule capable of 
modulating proteinase/kinase expression or proteinase/kinase activity. The vector may, for 
exaiiq)le, be a viral vector. This form of therapy is proposed to be particularly useful for gene 
replacement or enhancement therapy for HELA2 (testisin) especially for the modulation of 

10 fertility and/or treatment of testicular cancer. 

Still another aspect of the present invention is directed to antibodies to proteinase/kinase and 
its derivatives. Such antibodies may be monoclonal or polyclonal and may be selected from 
naturally occurrmg antibodies to proteinase/kinase or may be specifically raised to 

15 proteinase/kinase or derivatives thereof. In the case of the latter, proteinase/kinase or its 
derivatives may first need to be associated with a carrier molecule. The antibodies and/or 
recombinant proteinase/kinase or its derivatives of the present invention are particularly useful 
as therapeutic or diagnostic agents. For example, monitoring non-testicular cancer by 
measuring HELA2 (testisin) or screening for the presence of testicular cancer by an absence of 

20 HELA2 (testisin). 

Proteinase/kinase and its derivatives may also be used to screen for naturally occurring 
antibodies to proteinase/kinase. These may occur, for example in some autoimmune diseases. 
Alternatively, specific antibodies can be used to screen for proteinase/kinase. Techniques for 
25 such assays are well known in the art and mclude, for example, sandwich assays and ELIS A. 
Knowledge of proteinase/kinase levels may be important for diagnosis of certain cancers or a 
predisposition to cancers or for monitoring certain therapeutic protocols. 

Antibodies the proteinase/kinase of the present invention may be monoclonal or polyclonal. 
30 Alternatively, fragments of antibodies may be used such as Fab fragments. Furthermore, the 
present invention extends to recombinant and synthetic antibodies and to antibody hybrids. A 
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"synthetic antibody" is considered herein to include fragments and hybrids of antibodies. The 
antibodies of this aspect of the present invention are particularly useful for immunotherapy and 
may also be used as a diagnostic tool for assessing apoptosis or monitoring the program of a 
therapeutic regimen. 

5 

For example, specific antibodies can be used to screen for proteinase/kinase proteins. The latter 
would be important, for example, as a means for screening for levels of proteinase/kinase in a 
cell extract or other biological fluid or purifying proteinase/kinase made by recombinant means 
from culture supernatant fluid. Techniques for the assays contemplated herein are known in the 
10 art and include, for example, sandwich assays and ELISA. 

It is within the scope of this invention to include any second antibodies (monoclonal, polyclonal 
or fragments of antibodies or synthetic antibodies) directed to the first mentioned antibodies 
discussed above. Both the first and second antibodies may be used in detection assays or a first 
15 antibody may be used with a commercially available anti-immunoglobulin antibody. An 
antibody as contemplated herein includes any antibody specific to any region of 
proteinase/kinase. 

Both polyclonal and monoclonal antibodies are obtainable by immunization with the enzyme 
20 or protein and either type is utilizable for immunoassays. The methods of obtaining both types 
of sera are well known in the art. Polyclonal sera are less preferred but are relatively easily 
prepared by injection of a suitable laboratory animal with an effective amount of 
proteinase/kinase, or antigenic parts thereof, collecting serum from the animal, and isolating 
specific sera by any of the known immunoadsorbent techniques. Although antibodies produced 
25 by this method are utilizable in virtually any type of immunoassay, they ate generally less 
favoured because of the potential heterogeneity of the product. 

The use of monoclonal antibodies in an immunoassay is particularly preferred because of the 
ability to produce them in large quantities and the homogeneity of the product. The preparation 
30 of hybridoma cell lines for monoclonal antibody production derived by fusing an unmortal cell 
line and lynphocytes sensitized against the immunogenic preparation can be done by techniques 
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which are well known to those who are skilled in the art. 

Another aspect of the present invention contemplates a method for detecting proteinase/kinase 
in a biological sample from a subject said method comprising contacting said biological sample 
5 with an antibody specific for proteinase/kinase or its derivatives or horaologues for a time and 
under conditions sufficient for an antibody-proteinase/kinase complex to form, and then 
detecting said con^lex. 

The presence of proteinase/kinase may be accomplished in a number of ways such as by 
10 Western blotting and ELISA procedures. A wide range of immunoassay techniques are 
available as can be seen by reference to US Patent Nos. 4,016,043, 4, 424,279 and 4,018,653. 
These, of course, includes both single-site and two-site or "sandwich" assays of the non- 
competitive types, as well as in the traditional competitive binding assays. These assays also 
include direct binding of a labelled antibody to a target. 

15 

Sandwich assays are among the most useful and commonly used assays and are favoured for 
use in the present invention. A number ofvariationsofthe sandwich assay technique exist, and 
all are intended to be encompassed by the present invention. Brieny, in a typical forward assay, 
an unlabeUed antibody is immobilized on a solid substrate and the sample to be tested brought 

20 into contact with the bound molecule. After a suitable period of incubation, for a period of time 
sufficient to allow formation of an antibody-antigen complex, a second antibody specific to the 
antigen, labelled with a reporter molecule capable of producing a detectable signal is then added 
and incubated, allowing time sufficient for the formation of another complex of antibody- 
antigen-labeUed antibody. Any unreacted material is washed away, and the presence of the 

25 antigen is determined by observation of a signal produced by the reporter molecule. The results 
may either be qualitative, by simple observation of the visible signal, or may be quantitated by 
comparing with a control sample contMning known amounts of hapten. Variations on the 
forward assay include a simultaneous assay, in which both sample and labelled antibody are 
added simultaneously to the bound antibody. These techniques are well known to those skUled 

30 in the art, including any minor variations as will be readily apparent. In accordance with the 
present invention the sample is one which might contain proteinase/kinase including cell extract; 
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tissue biopsy or possibly semm, saliva, mucosal secretions, lymph, tissue fluid and respiratory 
fluid. The sample is, therefore, generally a biological sample comprising biological fluid but 
also extends to fermentation fluid and supernatant fluid such as from a cell culture. 

5 In the typical forward sandwich assay, a first antibody having specificity for the 
proteinase/kinase or antigenic parts thereof, is either covalently or passively bound to a soUd 
surface. The solid surface is typically glass or a polymer, the most commonly used polymers 
being cellulose, polyaciylamide, nylon, polystyrene, polyvinyl chloride or polyptDRrlene. The 
soUd supports may be in the form of tubes, beads, discs of microplates, or any other surface 

10 suitable for conducting an immunoassay. The binding processes arc well-known in the art and 
generally consist of cross-linking covalently binding or physically adsorbing, the polymea:- 
antibody complex is washed in preparation for the test sample. An aliquot of the sample to be 
tested is then added to the solid phase complex and incubated for a period of time sufficient 
(e.g. 2-40 minutes or overnight if more convenient) and under suitable conditions (e.g. from 

15 about room teiUKraturc to about 3TC) to allow binding of any subunit present in the antibody. 
Following the incubation period, the antibody subunit solid phase is washed and dried and 
incubated with a second antibody specific for a portion of the hapten. The second antibody is 
linked to a reporter molecule whkih is used to indicate the binding of the second antibody to the 
hapten. 

20 

An alternative method involves immobilizing the target molecules in the biological sample and 
then exposing the immobilized target to specific antibody which may or may not be labeUed 
with a reporter molecule. Depending on the amount of target and the strength of the reporter 
molecule signal, a bound target may be detectable by direct labelling with the antibody. 
25 Alternatively, a second labelled antibody, specific to the first antibody is exposed to the target- 
first antibody complex to form a target-first antibody-second antibody tertiary complex. The 
complex is detected by the signal emitted by the reporter molecule. 

By "reporter molecule" as used in the present specification, is meant a molecule which, by its 
30 chemical nature, provides an analytically identifiable signal which allows the detection of 
antigen-bound antibody. Detection may be either qualitative or quantitative. The most 
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commonly used reporter molecules in this type of assay are either enzymes, fluorophores or 
radionucleotide containing molecules (i.e. radioisotopes) and chemiluminescent molecules. 
In the case of an enzyme immunoassay, an enzyme is conjugated to the second antibody, 
generally by means of glutaraldehyde or periodate. As will be readily recognized, however, a 

5 wide variety of different conjugation techniques exist, which are readily available to the skilled 
artisan. Commonly used enzymes include horseradish peroxidase, glucose oxidase, beta- 
galactosidase and alkaline phosphatase, amongst others. The substrates to be used with the 
specific enzymes are generally chosen for the production, upon hydrolysis by the corresponding 
enzyme, of a detectable colour change. Examples of suitable enzymes include alkaline 

10 phosphatase and peroxidase. It is also possible to employ fluorogenic substrates, which yield 
a fluorescent product rather than the chromogenic substrates noted above. In all cases, the 
enzyme-labelled antibody is added to the first antibody hapten complex, allowed to bind, and 
then the excess reagent is washed away. A solution containing the appropriate substrate is then 
added to the con^lex of antibody-antigen-antibody. The substrate will react with the enzyme 

15 linked to the second antibody, giving a qualitative visual signal, which may be further 
quantitated, usually spectrophotonKtrically, to give an indication of the amount of hapten which 
was present in the sample. "Reporter molecule" also extends to use of cell agglutination or 
inhibition of agglutination such as red blood cells on latex beads, and the like. 

20 Alternately, fluorescent compounds, such as fluorescein and rhodanaine, may be chemically 
coupled to antibodies without altering their binding capacity. When activated by illumination 
with light of a particular wavelength, the fluorochrome-labelled antibody adsorbs the light 
energy, inducing a state to excitability in the molecule, followed by emission of the light at a 
characteristic colour visually detectable with a light microscope. As in the EIA, the fluorescent 

25 labelled antibody is allowed to bind to the first antibody-hapten complex. After washing off the 
unbound reagent, the remaining tertiary complex is then exposed to the light of the appropriate 
wavelength the fluorescence observed indicates the presence of the hapten of interest. 
Immunofluorescene and EIA techniques are both very well established in the art and are 
particularly prefen^ for the present method. However, other reporter molecules, such as 

30 radioisotope, chemiluminescent or bioluminescent molecules, may also be employed. 
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The present invention also contemplates genetic assays such as involving PCR analysis to detect 
proteinase/kinase gene or its derivatives. Alternative methods or methods used in conjunction 
include direct nucleotide sequencing or mutation scanning such as single stranded conformation 
polymorphoms analysis (SSCP), specific oligonucleotide hybridisation, and methods such as 
5 direct protein truncation tests. 

The nucleic acid molecules of the present invention may be DN A or RNA. When the nucleic 
acid molecule is in DNA form, it may be genomic DNA or cDNA. BINA forms of the nucleic 
acid molecules of the present invention are generally mRNA. 

10 

Although the nucleic acid molecules of the present invention are generally in isolated form, they 
may be integrated into or ligated to or otherwise fused or associated with other genetic 
molecules such as vector molecules and in particular expression vector molecules. Vectors and 
expression vectors are generally capable of replication and, if applicable, expression in one or 
15 both of a prokaryotic cell or a eukaryotic cell. Preferably, prokaryotic cells include E. coli. 
Bacillus sp and Pseudomonas sp. Preferred eukaryotic cells include yeast, fungal, mammalian 
and insect cells. 

Accordiiigly, another aspect of the present invention contemplates a genetic construct 
20 comprising a vector portion and a mammalian and more particularly a human proteinase/kinase 
gene portion, which proteinase/kinase gene portion is capable of encoding an proteinase/kinase 
polypeptide or a functional or immunologically interactive derivative thereof. 

Preferably, the proteinase/kinase gene portion of the genetic construct is operably Unked to a 
25 promoter on the vector such that said promoter is capable of directing expression of said 
proteinase/kinase gene portion in an appropriate cell. 

In addition, the proteinase/kinase gene portion of the genetic construct may comprise all or part 
of the gene fused to another genetic sequence such as a nucleotide sequence encoding 
30 glutathione-S-transferase or part thereof. 
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The present invention extends to such genetic constructs and to prokaryotic or eukaryotic cells 
comprising same. 

The present invention also extends to any or all derivatives of proteinase/kinase including 
5 mutants, part, fragments, portions, homologues and analogues or their encoding genetic 
sequence including single or multiple nucleotide or amino acid substitutions, additions and/or 
deletions to the naturally occurring nucleotide or amino acid sequence. The present invention 
farther encompasses hybrids between the proteinase/kinases such as to broaden the spectrum 
of activity and to ligands and substrates of the proteinase/kinase. 

10 

The proteinase/kinase and its genetic sequence of the present invention will be useful in the 
generation of a range of therapeutic and diagnostic reagents. 

Soluble proteinase/kinase polypeptides or other derivatives, agonists or antagonists are also 
15 contemplated to be useful in the treatment of disease, injuiy or abnormality in the nervous 
system, e.g. in relation to central or peripheral nervous system to treat Cerebral Palsy, trauma 
induced paralysis, vascular ischaemia associated with stroke, neuronal tumours, motoneurone 
disease, Parkinson's disease, Huntington's disease, Alzheimer's disease, Multiple Sclerosis, 
perq)heral neuropathies associated with diabetes, heavy metal or alcohol toxicity, renal failure 
20 and infisctious diseases such as herpes, rubella, measles, chicken pox, HIV or HTLV-L Other 
conditions for which the proteinase/kinase are useful include cancer, metastasis and autoimmune 
disease amongst many others. Particular applications for HELA2 (testisin) include as a marker 
for non-testicular cancers, in the treatment of testicular cancer and in the treatment of infertility 
or in inducing infertility such for contraception. 

25 

A further aspect of the present invention contemplates the use of proteinase/kinase or its 
fanctional derivatives in the manufacture of a medicament for the treatment of proteinase/kinase 
mediated conditions defective or deficient. 

30 The present invention is further described by the following non-limiting Examples. 
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EXAMPLE 1 
CLONING PROCEDURES 

In order to identify serine proteinases that may be involved in regulatory cellular functions, a 
5 genetic screening approach was applied using degenerate primers corresponding to conserved 
regions of serine proteinases (amino acids flanking His- and Ser- residues) to amplify gene 
fragments spanning these regions from cDNA, using a low stringency RT-PCR (Reverse 
Transcriptase-Polymerase Chain Reaction) approach. 

10 By this technique, the aim was to isolate low abundance genes as well as those present in 
moderate to high abundance. The cDNA used for these experiments was isolated from a HeLa 
cell cytotoxicity model wherein PAI-2 expression inhibits TNF(-induced apoptosis (Dickinson 
etal 1 Biol. Chem. 270: 27894-27904, 1995). These PAI-2 expressing cells provide a unique 
and viable system for investigating TNF(signalUng pathways as they are protected from the 

15 cytotoxic effects of TNF). 

cDNA was generated from RNA isolated from HeLa cells and PAI-2 expressmg HeLa cells, 
both untreated and following treatment with TNF and cycloheximide. Amplification of each 
cDNA population using PGR and the following serine proteinase degenerate primers. 

20 

His Primer: 5'ACAGAATTCTGGGTIGTIACJGCIGCICAYTG3' [SEQ ID NO:l], 
Ser Primer: 5'ACAGAATTCAXIGGICCICCIC/GT/AXTCICC3' [SEQ ID N0:2] 

(where X= A or G; Y= C or T; 1= Inosine) 

25 produced DNA fragments in the range of 480bp, the approximate predicted size of the serine 
proteinase intergenic region. These amplified DNA fr^ments were cloned into E. coli 
generating a library containing approximately 150 independent clones. The inventors analysed 
36 of these clones and found that 9 encoded previously identified serine proteinases or 
tissue-type or urokinase-type plasminogen activators, thereby demonstrating the efficacy of this 

30 approach. Of the other 36, two were found to encode novel open reading frames with high 
homology to serine proteinases and are referred to herein as "HELA2" (or "testisin") and 



' wo 98/36054 



PCT/AU98A)0085 



-41- 

"ATC2". One additional clone designated herein, "BCON3", showed homology to a kinase. 
Extension of the DNA fragments by RACE in both 5' and 3' directions using internally derived 
primers has verified the homology of HELA2 and ATC2 to the serine proteinase family. Each 
of the three DNA sequences are unique in that they are markedly different from any known 
5 DNA or protein sequence in the Genbank and NBRF databases. 

E3tAMPLE2 
HELA2 SERINE PROTEINASE (TESTISIN) 

10 The HELA2 mRNA transcript is approximately 1.5kb as determined from Northern blot 
analysis. Nucleic acid sequence was obtained for about Llkb of HELA2 which spans the entire 
coding region, the 3' noncoding region and part of the 5' noncoding region. The coding region 
starts with an ATG codon which is present in a motif analogous to the Kozak eukaryotic 
translation initiation consensus sequence. Alignment of the deduced amino acid sequence of 

15 HELA2 with homologous serine proteinases shows that the cDNA encodes a 3 14 amino acid 
(aa) polypeptide with a calculated molecular weight of 34.8kD (called Testisin), which is 
synthesized as a zynragen containing pre-, pro- and catalytic regions (Figure I). The pro- region 
(or light chain) and the catalytic region (heavy chain) are delineated by a classic serine 
proteinase activation motif Arg-De-Val-Gly-Gly [SEQ ID NO:24] with cleavage likely 

20 occurring between Arg and lie. The catalytic region includes the catalytic triad of His, Asp and 
Ser in positions and motifs which are highly conserved among the serine proteinases. Ten Cys 
residues occur in conserved positions: by analogy to other serine proteinases, eight of these 
function to form disulfide bridges within the catalytic region and the remaining two link the pro- 
and catalytic regions. 

25 

Structural features conserved in the binding pockets of serine proteinases are present in HELA2 
(testisin). An Asp residue at the bottom of the serine proteinase binding pocket sbt residues 
before the active site Ser in HELA2 (testisin) indicates that HELA2 (testisin) has trypsin-like 
specificity, with proteolytic cleavage after Arg or Lys in target substrates. HELA2 (testisin) also 
30 contains a conserved Ser-Trp-Gly motif at the top of the binding pocket which is likely involved 
in hydrogen bonding with target substrates in other serine proteinases. 
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A hydrophobicity plot of the HELA2 (testisin) amino acid sequence (Figure 1) identifies two 
hydrophobic regions, one located at the amino terminus and the other at the carboxy terminus. 
The 20 aa amino terminal hydrophobic region is likely to be a signal peptide, which would direct 
newly synthesized HELA2 to enter the endoplasmic reticulum. The 16 aa hydrophobic caiboxy 

5 terminus of HELA2 (testisin) shows high homology to the transmembrane domain of prostasin 
(Figure 2), suggesting that HELA2 (testisin) is likely to be a membrane-anchored serine 
proteinase. Thus HELA2 (testisin) may anchor on the germ cell surface where it could 
participate in a range of proteolytic activities, including participation in cell migration, 
differentiation and/or activation of growth factors and proteolytic cascades. In prostasin. this 

10 protruding carboxy terminus may be cleaved, thus releasing the serine proteinase from die 
membrane. A similar cleavage event may also occur with Testasin. 

Two isoforms of HELA2 were identified in a HeLa cell cDNA library (Stratagene UniZap HeLa 
Library) which differ by an insertion of 6 nucleotides which generates a Sfil restriction enzyme 
15 site. At the protein level, there is a corresponding insertion of 2 aa's (Tyr-Ser) withm the 
catalytic binding pocket (Figure 2A). The two isoforms of HELA2 cDNA are referred to as the 
short (S) and long (L) isoforms, respectively. The nucleotide and correspondmg amino acid 
sequence for the short isoform of HELA2 is diown in SEQ ID NOs. 3 and 4, respectively. The 
long isoform is shown in SEQ ID NO:5 and 6, respectively. 

20 

EXAMPLE 3 

GENERATION OF FULL LENGTH cDNA ENCODING HELA2 (TESTISIN) 

Partial cDNA fragments of the short and long isoforms of HELA2 were obtained using a 
25 combination of library screening techniques. Plasmids containing the full length cDNA of the 
two isoforms were then generated in pBluescriptSK(-) by ligating restriction enzyme-digested 
fragments of die partial cDNAs. A plasmid map of the two generated constructs. 
pBluescriptHELA2(S) and pBluescriptHELA2(L), and a restriction enzyme map of the long 
isoform cDNA arc diagrammed in Figure 3. 

30 

In vitro transcription/translation using HELA2 cDNA shows a major specific product of 
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approximately 35kD (Fig. 2B), which is the same as size predicted from the open reading frame, 
demonstrating that HELA2 cDNA encodes a protein. The translation/transcription coupled 
rabbit reticulocyte lysate system (Promega) was used as per the manufacturer's instructions for 
35S-methionine labeliing. Clones of HELA2 in pBluescript a PAI-2 positive control were used 
5 with T3-RNA polymerase (sense direction). 

EXAMPLE 4 

EXPRESSION OF RECOMBINANT HELA2 (TESTISIN) IN E.COLI 

10 (A) Generation of expression constructs 

(i) His(6)-tagged recombinant HELA2 (testisin) 
To reduce potential toxic elfects on host cells, and therefore optimise expression, a strategy was 
employed to eliminate the hydrophobic residues of the secretory and membrane anchoring 
domains of HELA2 (testisin) (Testisin (20-295)). Testisin (20-295) fragments which were His6 

15 tagged at either the anuno or carboxy tenninal were obtained by PGR and expression constructs 
were generated by inserting these into pQE vectors (Qiagen), 

The primers used to generate the amino-terminal tagged protein were: 
forward: 5* GCACAGTCGACCAAGCCGGAGTCGCAGAG 3* [SEQ ID NO: II] and 
20 reverse: 5' GCACA AAGCTTGCCAGGAGGGGTCTGGCTG 3' [SEQ ID NO: 12] 

The amplification product of 858bp was digested with Sail and Hindlll and ligated into pQE-10 
to give pQE-10(20-295)N (Figure 4). 

The primers used to generate the carboxy-terminal tagged protein were: 
25 forward: 5' GCACAACCATGGCCAAGCCGGAGTCGCAGGAG 3* [SEQ ID NO:131 and 
reverse 5' GCACAAGATCTCCAGGAGGGGTCTGGCTG 3' [SEQ ID NO: 14]. 
The amplification product of 859 bp was digested with Ncol and BgHI and ligated into PQE-60 
to give pQE-60(20-295)C (Figure 4). 

30 (ii) GST-tagged recombinant HELA2 (testisin) 

In order to generate a fusion of giutathione-S-transferase (GST) and HELA2 (testisin), 



wo 98/36054 



PCT/AUW00085 



-44- 

pBluescrq)tHELA2(S) was digested with Sau3Al. releasing a 570bp DNA fragment encoding 
the 190 amino acids at the carboxy terminal end of HELA2 (testisin). This DNA fragment was 
cloned into the BamHl site of pGEX-l generating pGEX- 1(90-279) (Figure 4) and subjected 
to DNA sequence analysis to confirm that the fusion was in frame, 

5 

(b) Expression of His-tagged HELA2 (testisin) in E. coli 

pQE10(20-295)N and pQE60(20-295)C plasmids were electro-transformed into E. coli DH5('- 
cells. Four different clones were selected for further analysis: His-N21 expressing amino 

10 terminal His6-tagged Testisin (20-295); and His-C21, His-C22, and His-C23 expressing 
carboxy terminal His6-tagged Testisin (20-295). To express recombinant HELA2 (testisin) 
protein, transformed cells were grown to log phase then induced for 4 hours in the presence of 
2mM IPTG. Cells were lysed in a denaturing lysis buffer containing 8M urea, 0. IM NaH2P04 
and 0,0 IM Tris/HCI pH8. Alternatively the cells were lysed in a non-denaturing lysis buffer 

15 containing 0. IM NaH2P04, 0. IM NaCl and 0.01 M Tris/HCI pH8. The His6 tagged protein 
was recovered by mixing the lysate with a metal affinity resin (Qiagen or Clontech). Purified 
testisin(L) was eluted with 100 mM EDTA in lysis buffer (pH 6.3). A major band of 
approximately 32 kDa was obtained in the eluate as shown by the arrows in Figure 5A. 
Western blot analysis of a purification of the His-C23 clone using an anti-His6 antibody showed 

20 that the band at 32 kDa was His6 tagged HELA2 (testisin) (Figure 5B). 

EXAMPLE 5 
IMMUNOLOGY 

25 (A) Rabbit Polyclonal Antibodies Directed Against HELA2 (testisin) Peptide Antigens 

Three peptides were selected from the HELA2 (testisin) amino acid sequence on the basis of 
predicted antigenicity, hydrophilicity and lack of identity with known proteins (Figure 6). 



30 Peptide antigen T20-33 KPESQEAAPLSGPC [SEQ ID NO: 15] 

Peptide antigen T46-63 EDAELGRWPWQGSLRLWDC [SEQ ID NO: 16] 
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Peptide antigen T175-190 GYIKEDEALPSPHTLQC [SEQ ID NO: 171 

These peptides were synthesized (Auspep) and coupled to keyhole limpet hemocyanin. The 
coupled peptide (500 Fg) in PBS (0.5 ml) was emulsified in an equal volume of Freund's 

5 complete adjuvant before injection into a rabbit. Booster injections of coupled peptide in 
Freund's incomplete adjuvant were made at intervals of 2 to 3 weeks. Each rabbit was bled 
(approximately 1 ml) before the initial injection and about 7 days after the second and 
subsequent boosters and the antibody titre assessed by direct EUSA assay. Immunoreactive 
antisera against the peptide antigens was demonstrated and when a sufficiently high titre was 

10 achieved (after 3 to 5 boosters), between 12 and 25 ml of blood was removed from each animal. 

Rabbit antisera was affinity purified against the respective immunising peptides by 
chromatography using peptide-coupled affinity columns. Immunoreactivily of the aflBnity 
purified antibodies against BELA2 (testisin) was demonstrated by Westem blot analysis of 

15 GST-tagged recombinant HELA2 (testisin). pGEX- 1(90-279) plasmid DNA (described in 
Exanq)le 4) was electro-transformed into E. coli DH5( cells and induced for 3 hours in the 
presence of 0.5mM IPTG. CeUs were lysed in 1.5% sarcosyl, 2% Triton XlOO and then 
sonicated. After removal of the insoluble fraction by centrifiigation, the cell lysate was mixed 
with a 50% slurry of Glutathione Sepharose 4B, washed, and the purified GST-Testisin(90-279) 

20 was eluted by boiling with SDS-Sample buffer. Figure 7 shows an example of Western blot 
analysis of the eluate using anti-Peptide T175-190 antibody demonstrating a purified, 
immunoreactive band representative of GST-linked HELA2(testisin) of approximately 47 kDa. 

(B) Rabbit Polyclonal Antibodies Directed Against Purified Bacterially Expressed HELA2 
25 (testisin) 

An SDS-PAGE gel slice containing purified His6 tagged HELA2 (testisin) (as described in 
Example 4, part (b)) is to be conibined with adjuvant and rabWts immunized as described above: 
Rabbit antisera are tested by Westem blot analysis for immunoreactivity against purified 
30 recombinant HELA2 (testisin) and HELA2 (testisin) in cell extracts, as well as use in 
immunohistochemical analyses. 
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EXAMPLE6 

EXPRESSION OF HELA2 (TESTISIN) IN EUKARYOTIC CELLS 
(A) Generation of expression constructs 

5 

Eukaryotic expression constructs encoding testisin(s) and testisin(L) His6 tagged at the carboxy 
terminal were generated in the eukaryotic expression vector pcDNA3 (Invitrogen). DNA 
fragments encoding HELA2 (testisin) were generated by PCR fiom both pBluescriptHELA2(S) 
and pBluescriptHELA2(L) using the primers: 
10 forward: 5' GCACAGGTACCGAGGCCATGGGCGCGCGC 3' [SEQ ID NO: 18] and 
reveree 5' gCACATCTAGATCAGTGGTGGTGGTGGTGGTGGACCGGCCCCAGGA 

GTGG 3' [SEQ ID NO: 19] 

The PGR product of 985 bp obtained from amplification of HELA2 (testisin) from 
15 pBluescriptHELA2(S) as template was ligated into pGEM-T (Easy) vector (Promega). 
Digestion of this shuttle construct with NotI released a 1025 bp fragment which was Ugated into 
pcDNA3 generating the short isoform expression construct pcDNA3-Test(S-C) (Figure 8). 
PGR amplification of the long isoform template gave a 991 bp product which was ligated into 
pGEM-T (easy) vector. NotI digestion of the shuttle construct released a 103 1 bp fragment 
20 which was ligated into pcDNA3 giving pcDNA3-Test(L-C) (Figure 8). 

Soluble testisin (l-295)-His6 in which the membrane anchoring sequence is deleted and the 
protein is caiboxy-His6 tagged is to be obtained by PGR ampliiication of HELA2 (testisin) from 
pBlucscriptHELA2(L) using the primers: 
25 forward: 5' GCACAGCGGCCGCGAGGCCATGGGCGCGCGC 3' [SEQ ID NO:20] and 
reverse: 5' GCACAGCGGCCGCTCAGTGGTGGTGGTGGTGGTGCCAGGAGGGGTC 
TGGCTG 3' [SEQ ID N0:21]. 

The PGR product will be digested with Nofl and ligated into pcDNA3 generating the long 
isoform expression construct pcDNA3-Tcst(l-295)L-C (Figure 8). 

30 

(B) Expression and cellular localisation of HELA2 (testisin) 
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Each of the expression constructs is transiently transfected into a eukaryotic cell line (eg. HeLa. 
CHO or COS cells) by electroporation. Expression is confirmed by Northern blot and 
immunoblot. The His6 tag is a small, uncharged tag which reportedly does not interfere with 
cellular membrane interactions and is able to be detected with anti-His6 antibodies. HELA2 

5 (testisin) cellular localisation is analysed by immunofluorescence using antibodies directed . 
against the His6 tag and stained cells examined by confocal microscopy. Mock transfected cells 
is monitored as one of the controls in these experiments. Cells are examined under 
non-permeablised and permeabilised conditions to investigate intraceUular and cell surface 
expression of HELA2 (testisin) tagged proteins. Possible release of HELA2 (testisin) into the 

10 supernatant is monitored by immunoblotting of conditioned media. Association of HELA2 
(testisin) witii a particular cellular compartment is confirmed by cellular fi-actionation smdies. 
Stable transfectants of the fiiU length and truncated tagged HELA2 (testisin) is generated by 
selection in G4I8. Recombinant HELA2 (testisin) is purified from these stable transfectants 
using a metal affinity lesin (eg. Qiagen or Clontech) for assay of its bioactivity and efficacy as 

15 a therapeutic reagent. 

EXAMPLE 7 

HELA2 (TESTISIN) IS SPECIFICALLY EXPRESSED IN 
THE NORMAL TESTIS, AND IS ASSOCIATED WITH SPERM DEVELOPMENT 

20 

(A) Normal Tissue Blot 

Dot blot analysis of PolyA+ RNA from 50 normal tissue specimens (strandardised to 8 different 
housekeeping genes) (Clontech) was performed using a 32P-labelled HELA2 (testisin) probe. 
25 Hybridization of tiie radiolabelled probe was in ExpressHyb solution (Clontech) at 65o. The 
blots were washed to a fmal stringency of O.lxSSC/0.5% w/v SDS. High level expression of 
HELA2 (testisin) was found only in the testis as shown by the histogram plot of the Signal 
Intensity in Figure 9. In contrast, probing of the same blot with BCON3 showed ubiquitous 
expression of BC0N3 mRNA in a variety of tissues (Figure 9). 

30 



(B) Multiple Tissue Northern Blot 
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Northern blots displaying polyA+ mRNA from 16 different normal tissues (Clontech) were 
hybridised at 65°C in ExpressHyb solution using a 400bp SacII/EcoRI 32P-labelIed HELA2 
probe for 3h and then washed to a final stringency of 0. lxSSC/0. 1 %SDS at 60^C. After a 5h 
exposure, a strong band was observed only in the lane containing testis niRN A, demonstrating 
5 the specificity of HELA2 (testisin) expression for the testis (Figure lOA). Prolonged exposure 
(4.5 days) of the blot revealed a very low level of HELA2 (testisin) mRNA expression in the 
prostate, lung and pancreas only. In contrast to HELA2. BCON3 is expressed in mRNA from 
most tissues present on the blot (Figure lOB). 

10 

(C) HELA2 (testisin) is Expressed in Sperm Cells, Demonstrating its Germ Cell Origin 

To determine whether HELA2 (testisin) expression is associated with germ cells of the testis, 
ejaculate specimens from normal fertUe males were compared with those of post-vasectomy 
15 males by RT-PCR analysis using HELA2 (testisin) specific primers. Sperm is the primary 
product fix)m the testis that is found in ejaculate; other components of the ejaculate are derived 
from the prostate. 

First strand cDNA was reverse transcribed from total RNA which has been isolated from frozen 
20 or fresh ejaculate specimens. PCR was performed on the cDNA templates using the primers: 
forward: 5^ CTGACTTCCATGCCATCCTT 3' [SEQ ID NO:22] and 
reverse: 5' GCTCACGACTCCAATCTGAT 3' [SEQ ID NO:23]. 

As shown in Figure 11, sU-ong signals of the expected size of 464 bp were detected in ejaculate 
from normal males (Patients #23 and #31), while no HELA2 (testisin) was detected in Patient 
25 #153 (post-vasectomy). Patent #90 (post-vasectomy) showed a low level of amplification 
product which may reflect a small amount of residual sperm in the seminiferous tubules. PCR 
using primers specific for (2-macroglobulin was performed on the same samples as a control 
for the presence of approximately equal amounts of cDNA in each sample. 

30 (D) HELA2 (testisin) is Expressed in Immature Germ Cells of the Testis 
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In situ hybridization was performed on paraffin-embedded specimens of rat testis tissue using 
DIG- labelled HELA2 (testisin) RNA probes (T3 and T7 generated transcripts containing 
nucleotides 1-423 of HELA2 cDNA). The results using the antisense RNA probe showed 
strong positive staining near the basal lamina of the seminiferous tubules in the ,^^ion 
5 associated with spermatocytes *and spermatogonia (Figure 12, see arrows). HELA2 (testisin) 
mRNA expression did not appear to be associated with Leydig cells and the pattern was not 
typical for Sertoli cell staining. The presence of HELA2 (testisin) mRNA in these cells indicates 
a role for HELA2 (testisin) in germ cell maturation and sperm development. 

10 

EXAMPLE 8 

HELA2 (TESTISIN) EXPRESSION IS ASSOCIATED WITH TUMOURS 
IN NON-TESTIS CELL-TYPES 

15 The tissue and cell-type distribution of testisin mRNA transcripts in tumours were determined 
by Northern hybridization analyses of RNA extracted from in vitro cultured tumour cells lines 
derived tcom different cancerous tissues. HELA2 (testisin) was detected in the HeLa ovarian 
carcinoma, the U937 lymphoma, and the melanoma cell line 253-3D. HELA2 (testisin) is also 
associated with cDNA libraries derived from tumours of tte colon, pancreas, prostate and ovary 

20 (NCBI-EST Database). The presence of HELA2 (testisin) in tumours where it is not expressed 
normally indicates that it likely plays a role in tumourigenesis in several cell-types. 

EXAMPLE 9 
THE HELA2 (TESTISIN) GENE IS LOCATED 
25 ON HUMAN CHROMOSOME 16pl33 

The genetic location of testisin was mapped to the short arm of chromosome 16 at 16pl3.3 by 
fluorescence in-situ hybridization to normal metaphase chromosomes (Figure 13A). Screening 
of a chromosome 16 hybrid panel then sub-localised HELA2 (testisin) to the cosmid 406D6 
30 which has been mapped to this region (Sood, R. et al (1997)Genomics 42: 83-95; Doggett, 
N.A. et al. (1995) Nature 377 (SuppL):335-365. The cosmid lies between the markers 
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D16S246 and D16S468 and the gene is located just centromeric to D16S246 (Figure 13B). 
This region of the human genome is associated with high genetic instability and telomeric 
rearrangements underlie a variety of common human genetic disorders. Testisin is sandwiched 
between the human disease genes PKDl (polycystic kidney disease) and tuberous sclarpsis 
5 (TSC2) on the on side, and MEF (familial mediterranean fever) and Rubenstein-Taybi syndrome 
(RSTS) on the other side as diagrammed in Figure 13B. 

EXAMPLE 10 

HELA2 (TESTISIN) mRNA AND PROTEIN EXPRESSION 
10 IS ABSENT IN TESTICULAR GERM CELL TUMOURS 

To determine whether HELA2 (testisin) may play a role in testicular tumourigenesis, HELA2 
(testisin) mRNA expression in normal testes and testicular tumour tissue obtained from 4 
patients diagnosed with seminoma were compared by Northern blot analysis. HELA2 (testisin) 
15 mRNA was detected in nomial testes from all four patients but was not detectable in the 
corresponding tumours (Figure 14A). This data indicates a tumour suppressor role for HELA2 
(testisin) in testicular germ cell tumours. 

Expression of HELA2 (testisin) protein in testicular tissue was exaniined by 
20 immunohistochemistry. Paraffin-embedded tissue sections were fixed, treated, blocked, 
incubated with anti-peptide antibodies (1:10 dilution) and bound antibody detected wifli the 
Vectastain Universal Elite ABC kit (Vector Laboratories). Negative controls were performed 
in the absence of antibody. Strong staining of HELA2 (testisin) was detected in the germ cells 
of normal testis (N) but was absent in the adjacent tumour tissue (T) (for example, see Figure 
25 14B), providing further evidence of a tumour suppressor role for HELA2 (testisin) in testicular 
germ cell tumours. 

EXAMPLE 11 

GENOMIC ORGANISATION OF THE HELA2 (TESTISIN) GENE 

30 

The HELA2 (testisin) gene is further characterised by determination of its genomic 
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organisation. Intron-extron boundaries and most of the DNA sequence of tiie HELA2 (testisin) 
gene was determined from cosmid DNA by DNA sequencing. A genomic map of HELA2 
(testisin) is given in Figure 15. The intron/exon boundaries are highly conserved relative to 
prostasin, although the sizes of the introns show considerable variation. The genomic DNA 
5 sequence with introns in tawer case and exons in upper case is shown in Figure 16 and in SEQ^^r^ 
ID NO 25. DNA sequence analysis is being performed on RNA from tumour tissues to 
ascertain the predicted function of HELA2 (testisin) as a tumour suppressor. " 

EXAMPLE 12 

10 THE HELA2 (TESTISIN) SHORT AND LONG ISOFORMS 

ARE GENERATED BY ALTERNATIVE mRNA SPLICING 

Two isoforms of HELA2 (testisin) were identified which differ by an insertion of 2 amino acids 
(Tyr-Ser) between the catalytic His and Asp residues. These constitute the long (L) and short 

15 (S) isoforms. At the DNA level there is a corresponding insertion of 6 nucleotides which 
generates a Sfcl restriction enzyme site. PCR amplification fix)m single strand cDNA generated 
from HeLa cell total RNA followed by DNA sequence analysis of the amplified product 
demonstrated that the two isoforms are generated through the use of two alternative mRNA 
splice sites. The DNA sequence for the intron and the flanking exons arc shown in Figure 17. 

20 The resulting insertion of amino acids YS occurs 4 amino acids after the catalytic His residue 
of HELA2 (testisin). Preliminary molecular modelling shows the presence of tliis insertion is 
likely to alter the catalytic activity and/or specificity of HELA2 (testisin) for its substrates. 

EXAMPLE 13 

25 MUTATION ANALYSIS-HELA2 (TESTISIN) AS A TUMOUR SUPPRESSOR 

Intronic DNA sequence information generated above (see Example 1 1) is used to generate " 
primers to amplify HELA2 (testisin) exons for SSCP analyses. Genomic DNA isolated from 
seminomas and corresponding normal testis as well as genomic DNA from wild-type and 
30 affected seminoma family members are analysed by SSCP for altered expression patterns 
indicative of genetic mutations. Evidence of genetic mutations are also being determined by 
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DNA sequence analysis. 



EXAMPLE 14 
HOMOLOGUES OF HUMAN HELA2 (TESTISIN) 
5 ARE PRESENT IN OTHER SPECIES 



Southern blot analysis of genomic DNA isolated firom a range of species using a HELA2 
(testisin) cDNA probe shows that homologues of HELA2 (testisin) are present in hamster, 
mouse, marmoset and monkey. The mouse homologue of HELA2 (testisin) was identified and 

10 obtained as an EST clone. The cDNA sequence and corresponding amino acid sequence of 
mouse HELA2 (testisin) was determined (Figure 18) and is given in SEQ ID NO 27. The 
mouse cDNA encodes a protein which contains the catalytic triad of His, Asp and Ser (circles) 
and 10 cysteine residues (small boxes), and an activation site (triangle) as found in HELA2 
(testisin). The hydrophilicity plot shows the presence of a hydrophobic sequence at the carboxy 

15 terminus suggesting the presence of a putative membrane anchor. Comparison of the mouse and 
human sequences show 68.1% homology at the cDNA level and 69.1% homology at the amino 
acid level. 



EXAMPLE 15 

20 HELA2 (TESTISIN) IS PART OF A CLUSTER OF 

HOMOLOGOUS GENES ON CHROMOSOME 16pl3.3 



Analysis of DNA sequences released to NCBI databases reveals the presence of homologues 
of HELA2 (testisin) in a cluster on Chromosome 16pl3.3. Figure 19 shows the positions of 

25 these genes, designated SPOOILA, SP002LA, SP003LA, and SP004LA, relative to HELA2 
(testisin) and the respective cosmids (Sood, R. et al (1997) Genomics 42: 83-95) in which they 
are located. Figure 20A, 20B and 20C show the partial cDNA and deduced amino acid 
sequences of SP(X)1LA, SP002LA, and SP003LA respectively. Each cDNA encodes a protein 
which contains the catalytic triad of His, Asp and Ser (circles) and 10 cysteine residues (small 

30 boxes), and an activation site (triangle) as found in HELA2 (testisin). Comparisons of the 
cDN A and amino acid sequences from the heavy chain region through to the poly A tail gives 
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the % identity with HELA2 (testisin) as foUows: 





cDNA 


Protein 


SPOOILA 


34.8% 


47.3% 


SP002LA 


41.0% 


47.1% 


SP003LA 


40.3% 


51.3% 




• ••• — -.i-j.sji^- 



Each of the serine proteinases encoded by these genes show that they have carboxy terminal 
extensions, and SP002LA is the only one with a hydrophobic carboxy tenninal tail indicative 

10 of a membrane anchored protein. Identification of an expressed sequence tag (EST) from a 
human testis cDNA Ubrary demonstrates that this gene is expressed in the testis, like HELA2 
(testisin). The location of this serine proteinase cluster on chromosome 16pl3.3 flanking 
HELA2 (testisin) suggests that these serine proteinases are also involved, like HELA2(testisin), 
in sperm mamration and development. Thus they may constitute a proteolytic cascade which 

15 is essential for these processes. Loss or mutation of these genes may lead to testicular germ cell 
tumours and to other testicular abnormalities, such as infertility. 



EXAMPLE 16 
ATC2 SERINE PROTEINASE 

20 

ATC2 was isolated from the cDNA of PAI-2 expressing HeLa cells following treatment with 
TNF and cycloheximkle. A partial DNA sequence for ATC2 cDNA has been obtained which 
encompasses the sequence encoding the serine proteinase catalytic region. Additional clones 
extending to both 5" and 3' directions have been obtained. The available nucleic acid sequence 

25 of ATC2 cDNA and its deduced amino acid sequence shows that it is a member of the serine 
proteinase femily with homology to hcpsin. prostasin. and acrosin. It thus belongs to the same 
family as HELA2. The catalytic region includes the His, Asp and Ser conserved motifs. 
Preliminary Northern blot experiments have foiled to detect ATC2 mRNA in total RNA isolated 
from resting HcLa cells, indicating it is not expressed in abundance in these ceUs, which noay 

30 therefore be tightly regulated. As ATC2 was isolated firom cells foUowing treatment with TNF 
and cycloheximide, its expression may be induced by these agents in HeU ceUs. These data 
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have potential significance for a role for ATC2 in apoptosis and cell death. ATC2 may be 
intracellular, extracellular or found on the cell surface and is likely to be involved in regulating 
cell functions. Thus ATC2 may have potential significance in the treatment of cancer and 
diseases involving dysregulation of cell growth and survival. The nucleotide and corresponding 
5 amino acid sequence of ATC2 is shown in SEQ ID NOs: 7 and 8, respectively. 

EXAMPLE 16 
BC0N3 

10 

The deduced amino acid sequence of BC0N3 (SEQ ID NO: 10) reveals that it is novel. At both 
the DNA and protein level, BCON3 shows homology to members of the kinase family of 
proteins. Although it cannot be classified as a member of any particular sub-family of kinases, 
alignments of the BCON3 protein with the conserved domains of thymidine kinases and tyrosine 

1 5 and serine/threonine protein kinases indicates possible ATP/GTP binding and phosphate transfer 
regions. Thus, it may be the first member of a new family of kinases. Analysis of the translation 
product using hydrophobicity plots and the Prosite protein analysis algorithms indicates BCON3 
may lack an N-terminal signal sequence (that is, it is likely to encode an intracellular protein) 
and it possesses a nuclear localization signal. BC0N3 mRNA is approxunately 2300 

20 nucleotides in length. cDNA sequence (SEQ ID N0:9) has been obtained covering about 95% 
of the transcript and including the 3' poly A tail BC0N3 mRNA is expressed in most normal 
tissues as demonstrated by dot blot analysis of 50 normal tissue specimens (standardised to 8 
different housekeeping genes) (Clontech). (Figure 9). Analysis of BCON3 mRNA expression 
using a multiple tissue Northern blot displaying polyA+ mRNA from 16 different normal tissues 

25 (Clontech) shows that BCON3 is expressed in most tissues (Figure lOB). Expression by in 
vitro transcription/translation expression using a partial BC0N3 cDNA firagment shows 
BC0N3 encodes a protein. Two major transcription/translation products are detected, one of 
51kDa, the size predated from the open reading frame, and a second product of about 43kDa, 
which may represent a partial translation product (Figure 21). 

30 

Those skilled in the art will appreciate that the invention described herein is susceptible to 
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variations and modifications other than those specifically described. It is to be understood that 
the invention includes all such variations and modifications. The invention also includes all of 
the steps, features, compositions and compounds referred to or indicated in this specification, 
individually or collectively, and any and aU combinations of any two or more of said steps or 
5 features. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT 

(US only): ANTALIS Toai Marie and HOOPER John David 
(Other than US): AMRAD OPERATIONS PTY LTD 

(ii) TITLE OF INVENTION: NOVEL MOLECULES 

(iii) NUMBER OF SEQUENCES: 24 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: DA VIES COLLISONCAVE 

(B) STREET: 1 LITTLE COLLINS STREET 

(C) CITY: MELBOURNE 

(D) STATE: VICTORIA 

(E) COUNTRY: AUSTRALIA 

(F) ZIP: 3000 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: International PCT Application 

(B) FILING DATE: 13-FEB-1998 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: P05 101/97 

(B) FILING DATE: 13-FEB-1997 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: PP0422/97 

(B) FILING DATE: 18-NOV-1997 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 
(A) NAME: HUGHES, DR E JOHN L 

(C) REFERENCE/DOCKET NUMBER: EJH/AF 
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(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: +6 1 3 9254 2777 
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(2) INFORMATION FOR SEQ ID NO:li ' ' \- 

|i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs . / 

(B) TYPE: nucleic acid . • - 

(C) STRANDEDNBSS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

32 

ACAGAATTCT GGGTIGTIAC IGCIGCICAY TG 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1094 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
ACAGAATTCA XIGGICCICC IC/GT/AXTCICC 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1094 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

( ix) FEATURE : 

(A) NAME/KEY: CDS 

IB) LOCATION: 17.. 965 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 

CGCGGGAGAG GAGGCC ATG GGC GCG CGC GGG GCG CTG CTG CTG GCG CTG 
Met Gly Ala Arg Gly Ala Leu Leu Leu Ala Leu 
1 5 10 

CTG CTG GOT CGG GOT GGA CTC AGG AAG CCG GAG TCG CAG GAG GCG GCG 
Leu Leu Ala Arg Ala Gly Leu Arg Lys Pro Glu Ser Gin Glu Ala Ala 
15 20 25 

CCG TTA TCA GGA CCA TGC GGC CGA CGG GTC ATC ACG TCG CGC ATC GTO 
Pro Leu ser Gly Pro Cys Gly Arg Arg Val He Thr Ser Arg He val 
30 35 40 

QGT GGA GAG GAC GCC GAA CTC GGG CGT TGG CCG TGG CAG GGG AGC CTG 
Gly Gly Glu Asp Ala Glu Leu Gly Arg Trp Pro Trp Gin Gly Ser Leu 
45 50 55 

CGC CTG TGG GAT TCC CAC GTA TGC GGA GTG AGC CTG CTC AGC CAC CGC 
Arg Leu Trp Asp Ser His Val Cys Gly Val Ser Leu Leu Ser His Arg 
60 65 70 75 

TGG GCA CTC ACG GCG GCG CAC TCC TTT GAA ACT GAC CTT AGT GAT CCC 
Trp Ala Leu Thr Ala Ala His Cys Phe Glu Thr Asp Leu Ser Asp Pro 
80 85 90 

TCC GGG TGG ATC GTC CAG TTT GGC CAG CTG ACT TCC ATG CCA TCC TTC 
ser Gly Trp Met Val Gin Phe Gly Gin Leu Thr Ser Met Pro Ser Phe 
95 100 105 

TGG AGC CTC CAG GCC TAC TAG ACC CGT TAC TTC GTA TCG AAT ATC TAT 
Trp Ser Leu Qln Ala Tyr Tyr Thr Arg Tyr Phe Val Ser Asn He Tyr 
110 115 120 

CTC AGC CCT CGC TAC CTC GGG AAT TCA CCC TAT GAC ATT GCC TTC GTC 
Leu ser Pro Arg Tyr Leu Gly Asn Ser Pro Tyr Asp He Ala Leu Val 
125 130 135 

AAG CTC TCT GCA CCT GTC ACC TAC ACT AAA CAC ATC CAG CCC ATC TCT 
Lys Leu Ser Ala Pro Val Thr Tyr Thr Lys His He Gin Pro He Cys 
140 145 150 155 

CTC CAG GCC TCC ACA TTT GAG TTT GAG AAC CGG ACA GAC TCC TCG GTC 
Leu Gin Ala Ser Thr Phe Glu Phe Glu Asn Arg Thr Asp Cys Trp Val 
160 165 170 , 



49 



97 



145 



193 



241 



289 



337 



385 



433 



481 



529 
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ACT GGC TGG GGG TAC ATC AAA GAG GAT GAG GCA CTG CCA TCT CCC CAC 
Thr Gly Trp Gly Tyr He hys Glu Asp Glu Ala Leu Pro Ser Pro His 
175 180 185 

ACC CTC CAG GAA GTT CAG GTC GCC ATC ATA AAC AAC TCT ATG TGC AAC 
Thr Leu Gin Glu Val Gin Val Ala He He Asn Asn Ser Met Cys Asn 
190 195 200 

CAC CTC TTC CTC AAG TAC AGT TTC CGC AAG GAC ATC TTT GGA GAC ATG 
His Leu Phu Leu Lys Tyr Ser Phe Arg Lys Asp He Phe Gly Asp Met 
205 210 215 

GTT TGT GCT GGC AAT GCC CAA GGC GGG AAG GAT GCC TGC TTC GGT GAC 
Val Cys Ala Gly Asn Ala Gin Gly Gly Lys Asp Ala Cys Phe Gly Asp 
220 225 230 235 

TCA GGT GGA CCC TTG GCC TGT AAC AAG GAT GGA CTG TGG TAT CAG ATT 
Ser Gly Gly Pro Leu Ala Cys Asn Lys Asp Gly Leu Trp Tyr Gin He 
240 245 250 

GGA GTC GTG AGC TGG GGA GTG GGC TGT GGT CGG CCC AAT CGG CCC GGT 
Gly Val Val Ser Trp Gly Val Gly Cys Gly Arg Pro Asn Arg Pro Gly 
255 260 265 

GTC TAC ACC AAT ATC AGC CAC CAC TTT GAG TGG ATC CAG AAG CTG ATG 
Val Tyr Thr Asn He Ser His His Phe Glu Trp He Gin Lys Leu Met 
270 275 280 

GCC CAG AGT GGC ATG TCC CAG CCA GAC CCC TCC TGG CCG CTA CTC TTT 
Ala Gin Ser Gly Met Ser Gin Pro Asp Pro Ser Trp Pro Leu Leu Phe 
285 290 295 

TTC CCT CTT CTC TGG GCT CTC CCA CTC CTG GGG CCG GTC TGA 
Phe Pro Leu Leu Trp Ala Leu Pro Leu Leu Gly Pro Val * 
300 305 310 

GCCTACCTGA GCCCATGCAG CCTGGGGCCA CTGCCAAGTC AGGCCCTGGT TCTCTTCTGT 
CTTGTTTGGT AATAAACACA TTCCAGTTGA TGCCTTGCAG GGCATTTTTC AAAAAAAAAA 
AAAAAAAAAA AAAAAAAAA 



577 

625 

673 

.721 

769 

817 

865 

913 

961 

1015 
1075 
1094 



(2) INFORMATION FOR SEQ ID NO: 4: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 313 amino acids 

(B) TYPE; amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Gly Ala Arg Gly Ala Leu Leu Leu Ala Leu Leu Leu Ala Arg Ala 
1 5 10 15 

Gly Leu Arg Lys Pro Glu Ser Gin Glu Ala Ala Pro Leu Ser Gly Pro 
20 25 30 

Cys Gly Arg Arg Val lie Thr Ser Arg lie Val Gly Gly Glu Asp Ala 
35 40 45 

Glu Leu Gly Arg Trp Pro Trp Gin Gly Ser Leu Arg Leu Trp Asp Ser 
50 55 60 

His Val Cys Gly Val Ser Leu Leu Ser His Arg Trp Ala Leu Thr Ala 
65 70 75 80 

Ala His Cys Phe Glu Thr Asp Leu Ser Asp Pro Ser Gly Trp Met Val 
85 90 95 

Gin Phe Gly Gin Leu Thr Ser Met Pro Ser Phe Trp Ser Leu Gin Ala 
100 105 110 

Tyr Tyr Thr Arg Tyr Phe Val Ser Asn lie Tyr Leu Ser Pro Arg Tyr 
115 120 125 

Leu Gly Asn Ser Pro Tyr Asp lie Ala Leu Val Lys Leu Ser Ala Pro 
130 135 140 

Val Thr Tyr Thr Lys His lie Gin Pro lie Cys Leu Gin Ala Ser Thr 
145 150 155 160 

Phe Glu Phe Glu Asn Arg Thr Asp Cys Trp Val Thr Gly Trp Gly Tyr 
165 170 175 



lie Lys Glu Asp Glu Ala Leu Pro Ser Pro His Thr Leu Gin Glu Val 
180 185 190 
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Gin Val Ala He He Asn Asn Ser Met Cys Asn His Leu Phe Leu Lys 
195 200 205 

Tyr Ser Phe Arg Lys Asp He Phe Gly Asp Met Val Cys Ala Gly Asn 
210 215 220 

Ala Gin Gly Gly Lys Asp Ala Cys Phe Gly Asp Ser Gly Gly Pro Leu 
225 230 235 240 

Ala Cys Asn Lys Asp Gly Leu Trp Tyr Gin He Gly Val Val Ser Trp 
245 250 255 

Gly Val Gly Cys Gly Arg Pro Asn Arg Pro Gly Val Tyr I'hr Asn He 
260 265 270 

Ser His His Phe Glu Trp He Gin Lys Leu Met Ala Gin Ser Gly Met 
275 280 285 

Ser Gin Pro Asp Pro Ser Trp Pro Leu Leu Phe Phe Pro Leu Leu Trp 
290 295 300 

Ala Leu Pro Leu Leu Gly Pro Val * 
305 310 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1100 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 17.. 961 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

CGCGGGAGAG GAGGCC ATG GGC GCG CGC GGG GCG CTG CTG CTG GCG CTG 
Met Gly Ala Arg Gly Ala Leu Leu Leu Ala Leu 
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CTG CTG GCT CGG GCT GGA CTC AGG AAG CCG GAG TCG CAG GAG GCG GCG 
Leu Leu Ala Arg Ala Gly Leu Arg Lys Pro Glu Ser Gin Glu Ala Ala 
15 20 25 



97 



CCG TTA TCA GGA CCA TGC GGC CGA CGG GTC ATC ACG TCG CGC ATC GTG 145 
Pro Leu Ser Gly Pro Cys Gly Arg Arg Val He Thr Ser Arg He Val 
30 35 40 



GGT GGA GAG GAG GCC GAA CTC GGG CGT TGG CCG TGG CAG GGG AGC CTG 193 
Gly Gly Glu Asp Ala Glu Leu Gly Arg Trp Pro Trp Gin Gly Ser Leu 
45 50 55 



CGC CTG TGG GAT TCC CAC GTA TGC GGA GTG AGC CTG CTC AGC CAC CGC 241 
Arg Leu Trp Asp Ser His Val Cys Gly Val Ser Leu Leu Ser His Arg 
60 65 70 75 



TGG GCA CTC ACG GCG GCG CAC TGC TTT GAA ACC TAT AGT GAC CTT AGT 289 
Trp Ala Leu Thr Ala Ala His Cys Phe Glu Thr Tyr Ser Asp Leu Ser 
80 85 90 



GAT CCC TCC GGG TGG ATG GTC CAG TTT GGC CAG CTG ACT TCC ATG CCA 337 
Asp Pro Ser Gly Trp Met Val Gin Phe Gly Gin Leu Thr Ser Met Pro 
95 100 105 



TCC TTC TGG AGC CTG CAG 
Ser Phe Trp Ser Leu Gin 
110 

ATC TAT CTG AGC CCT CGC 
He Tyr Leu Ser Pro Arg 
125 

TTG GTG AAG CTG TCT GCA 
Leu Val Lys Leu Ser Ala 
140 145 



GCC TAC TAC ACC CGT TAC 
Ala Tyr Tyr Thr Arg Tyr 
115 

TAC CTG GGG AAT TCA CCC 
Tyr Leu Gly Asn Ser Pro 
130 135 

CCT GTC ACC TAC ACT AAA 
Pro Val Thr Tyr Thr Lys 
150 



TTC GTA TCG AAT 385 

Phe Val Ser Asn 

120 

TAT GAC ATT GCC 433 
Tyr Asp He Ala 

CAC ATC CAG CCC 481 
His He Gin Pro 
155 



ATC TGT CTC CAG GCC TCC ACA TTT GAG TTT GAG AAC CGG ACA GAC TGC 529 
He Cys Leu Gin Ala Ser Thr Phe Glu Phe Glu Asn Arg Thr Asp Cys 
160 165 170 

TGG GTG ACT GGC TGG GGG TAC ATC AAA GAG GAT GAG GCA CTG CCA TCT 577 
Trp Val Thr Gly Trp Gly Tyr He Lys Glu Asp Glu Ala Leu Pro Ser 
175 180 185 
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CCC CAC ACC CTC CAG GAA GTT CAG GTC GCC ATC ATA AAC AAC TCT ATG 625 
Pro His Thr Leu Gin Glu Val Gin Val Ala lie lie Asn Asn Ser Met 
190 195 200 

TGC AAC CAC CTC TTC CTC AAG TAC AGT TTC CGC AAG GAC ATC TTT GGA 673 
Cys Asn His Leu Phe Leu Lys Tyr Ser Phe Arg Lys Asp li.e Phe Gly 
205 210 215 

GAC ATG GTT TGT GCT GGC AAT GCC CAA GGC GGG AAG GAT GCC TGC TTC 721 
Asp Met Val Cys Ala Gly Asn Ala Gin Gly Gly Lys Asp Ala Cys Phe 
220 225 230 235 

GOT GAC TCA GGT GGA CCC TTG GCC TGT AAC AAG GAT GGA CTG TGG TAT 769 
Gly Asp ser Gly Gly Pro Leu Ala Cys Asn Lys Asp Gly Leu Trp Tyr 
240 245 250 

CAG ATT GGA GTC GTG AGC TGG GGA GTG GGC TGT GGT CGG CCC AAT CGG 817 
Gin lie Gly Val Val Ser Trp Gly Val Gly Cys Gly Arg Pro Asn Arg 
255 260 265 

CCC GGT GTC TAC ACC AAT ATC AGC CAC CAC TTT GAG TGG ATC CAG AAG 865 
Pro Gly Val Tyr Thr Asn lie Ser His His Phe Glu Trp lie Gin Lys 
270 275 280 

CTG ATG GCC CAG AGT GGC ATG TCC CAG CCA GAC CCC TCC TGG CCG CTA 913 
Leu Met Ala Gin Ser Gly Met Ser Gin Pro Asp Pro Ser Trp Pro Leu 
285 290 295 

CTC TTT TTC CCT CTT CTC TGG GCT CTC CCA CTC CTG GGG CCG GTC TGAGCCTACC 
968 

Leu Phe Phe Pro Leu Leu Trp Ala Leu Pro Leu Leu Gly Pro Val 
300 305 310 315 

TGAGCCCATG CAGCCTGGGG CCACTGCCAA GTCAGGCCCT GGTTCTCTTC TGTCTTGTTT 1028 
GGTAATAAAC ACATTCCAGT TGATGCCTTG CAGGGCATTT TTCAAAAAAA AAAAAAAAAA 1088 
AAAAAAAAAA AA ^^^^ 



(2) INFORMATION FOR SEQ ID NO: 6; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 314 «imino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:6: 

Met Gly Ala Arg Gly Ala Leu Leu Leu Ala Leu Leu Leu Ala Arg Ala 
15 10 15 

Gly Leu Arg Lys Pro Glu Ser Gin Glu Ala Ala Pro Leu Ser Gly Pro 
20 25 30 

Cys Gly Arg Arg Val He Thr Ser Arg He Val Gly Gly Glu Asp Ala 
35 40 45 

Glu Leu Gly Arg Trp Pro Trp Gin Gly Ser Leu Arg Leu Trp Asp Ser 
50 55 60 

His val Cys Gly Val Ser Leu Leu Ser His Arg Trp Ala Leu Thr Ala 
65 70 75 80 

Ala His Cys Phe Glu Thr Tyr Ser Asp Leu Ser Asp Pro Ser Gly Trp 
85 90 95 

Met Val Gin Phe Gly Gin Leu Thr Ser Met Pro Ser Phe Trp Ser Leu 
100 105 110 

Gin Ala Tyr Tyr Thr Arg Tyr Phe Val Ser Asn He Tyr Leu Ser Pro 
115 120 125 

Arg Tyr Leu Gly Asn Ser Pro Tyr Asp He Ala Leu Val Lys Leu Ser 
130 135 140 



Ala Pro Val Thr Tyr Thr Lys His 
145 150 

Ser Thr Phe Glu Phe Glu Asn Arg 
165 

Gly Tyr He Lys Glu Asp Glu Ala 
180 



He Gin Pro He Cys Leu Gin Ala 
155 160 

Thr Asp Cys Trp Val Thr Gly Trp 
170 175 

Leu Pro Ser Pro His Thr Leu Gin 
185 190 



Glu Val Gin Val Ala He He Asn Asn Ser Met Cys Asn His Leu Phe 
195 200 205 
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Leu Lys Tyr Ser Phe Arg Lys Asp lie Phe Gly Asp Met Val Cys Ala 
210 215 220 

Gly Asn Ala Gin Gly Gly Lys Asp Ala Cys Phe Gly Asp Ser Gly Gly 
225 230 235 240 

Pro Leu Ala Cys Asn Lys Asp Gly Leu Trp Tyr Gin He Gly Val Val 
245 250 255 

Ser Trp Gly Val Gly Cys Gly Arg Pro Asn Arg Pro Gly Val Tyr Thr 
260 265 270 

Asn He Ser His His Phe Glu Trp He Gin Lys Leu Met Ala Gin Ser 
275 280 285 

Gly Met Ser Gin Pro Asp Pro Ser Trp Pro Leu Leu Phe Phe Pro Leu 
290 295 300 



Leu Trp Ala Leu Pro Leu Leu Gly Pro Val 
305 310 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 799 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 24.. 799 



Ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

AGTTCAGATG AATGGGACTG TGA GAA CCA TCT GTG ACC AAA TTG ATA CAG 50 

Glu Pro Ser Val Thr Lys Leu He Gin 
1 5 



GAA CAG GAG AAA GAG CCG CGG TGG CTG ACA TTA CAC TCC AAC 'TGG GAG 



98 
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Glu Gin Glu Lys Glu Pro Arg Trp Leu Thr Leu His Ser Asn Trp Glu 
10 15 20 25 . . 

AGC CTC AAT GGG ACC ACT TTA CAT GAA CTT GTA GTA AAT GGG GAG TCT 146 
Ser Leu Asn Gly Thr Thr Leu His Glu Leu Val Val Asn Gly Gin Ser 
30 35 40 

TGT GAG AGC AGA AGT AAA ATT TCT CTT CTG TGT ACT AAA CAA GAC TGT 194 
Cys Glu Ser Arg Ser Lys lie Ser Leu Leu Cys Thr Lys Gin Asp Cys 
45 50 55 

GGG CGC CGC CCT GCT GCC CGA ATG AAC AAA AGG ATC CTT GGA GGT CGG 242 
Gly Arg Arg Pro Ala Ala Arg Met Asn Lys Arg He Leu Gly Gly Arg 
60 65 70 

ACG AGT CGC CCT GGA AGG TGG CCA TGG CAG TGT TCT CTG CAG AGT GAA 290 
Thr Ser Arg Pro Gly Arg Trp Pro Trp Gin Cys Ser Leu Gin Ser Glu 
75 80 85 

CCC AGT GGA CAT ATC TGT GGC TGT GTC CTC ATT GCC AAG AAG TGG GTT 338 
Pro Ser Gly His He Cys Gly Cys Val Leu He Ala Lys Lys Trp Val 
90 95 100 105 

GTG AC A GTT GCC CAC TGC TTC GAG GGG AGA GAG AAT GCT GCA GTT TGG 3 86 

Val Thr Val Ala His Cys Phe Glu Gly Arg Glu Asn Ala Ala Val Trp 
110 115 120 

AAA GTG GTG CTT GGC ATC AAC AAT CTA GAC CAT CCA TCA GTG TTC ATG 434 
Lys val Val Leu Gly He Asn Asn Leu Asp His Pro Ser Val The Met 
125 130 135 

CAG AC A CGC TTT GTG AGG ACC ATC ATC CTG CAT CCC CGC TAC AGT CGA 482 
Gin Thr Arg Phe Val Arg Thr He He Leu His Pro Arg Tyr Ser Arg 
140 145 150 

GCA GTG GTG GAC TAT GAC ATC AGC ATC GTT GAG CTG AGT GAA GAC ATC 530 
Ala Val Val Asp Tyr Asp He Ser He Val Glu Leu Ser Glu Asp He 
155 160 165 

AGT GAG ACT GGC TAC GTC CGG CCT GTC TGC TTG CCC AAC CCG GAG CAG 578 
Ser Glu Thr Gly Tyr Val Arg Pro Val Cys Leu Pro Asn Pro Glu Gin 
170 175 180 185 

TGG CTA GAG CCT GAC ACG TAC TGC TAT ATC ACA GGC TGG GGC CAC ATG 626 
Trp Leu Glu Pro Asp Thr Tyr Cys Tyr He Thr Gly Trp Gly His Met 
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190 



195 200 



GGC AAT AAA ATG CCA TTT AAG CTG CAA GAG GGA GAG GTC CGC ATT ATT 674 

Gly Asn Lys Met Pro Phe Lys Leu Gin Glu Gly Glu Val Arg He He 
205 210 215 

TCT CTG GAA CAT TGT CAG TCC TAC TTT GAC ATG AAG ACC ATC ACC ACT 722 

Ser Leu Glu His Cys Gin Ser Tyr Phe Asp Met Lys Thr He Thr Thr 
220 225 230 

CGG ATG ATA TGT GCT GGC TAT GAG TCT GGC ACA GTT GAT TCA TGC ATG 770 

Arg Met He Cys Ala Gly Tyr Glu Ser Gly Thr Val Asp Ser Cys Met 
235 240 245 



GGT GAC TGG GGC GGT CCG TTG AAT TCT GT 
Gly Asp Trp Gly Gly Pro Leu Asn Ser 
250 255 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 258 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO . 8 : 

Glu Pro Ser Val Thr Lys Leu He Gin Glu Gin Glu Lys Glu Pro Arg 
15 10 15 

Trp Leu Thr Leu His Ser Asn Trp Glu Ser Leu Asn Gly Thr Thr Leu 
20 25 30 

His Glu Leu Val Val Asn Gly Gin Ser Cys Glu Ser Arg Ser Lys He 
35 40 45 

Ser Leu Leu Cys Thr Lys Gin Asp Cys Gly Arg Arg Pro Ala Ala Arg 
50 55 60 

Met Asn Lys Arg He Leu Gly Gly Arg Thr Ser Arg Pro Gly Arg Trp 
65 70 75 80 



799 
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Pro Trp Gin Cys Ser Leu Gin Ser Glu Pro Ser Gly His lie Cys Gly 
85 90 95 

Cys Val Leu lie Ala Lys Lys Trp Val Val Thr Val Ala His Cys Phe 
100 105 110 

Glu Gly Arg Glu Asn Ala Ala Val Trp Lys Val Val Leu Gly He Asn 
115 120 125 

Asn Leu Asp His Pro Ser Val Phe Met Gin Thr Arg Phe Val Arg Thr 
130 135 140 

He He Leu His Pro Arg Tyr Ser Arg Ala Val Val Asp Tyr Asp He 
145 150 155 160 

Ser He Val Glu Leu Ser Glu Asp He Ser Glu Thr Gly Tyr Val Arg 
165 170 175 

Pro Val Cys Leu Pro Asn Pro Glu Gin Trp Leu Glu Pro Asp Thr Tyr 
180 185 190 

Cys Tyr He Thr Gly Trp Gly His Met Gly Asn Lys Met Pro Phe Lys 
195 200 205 

Leu Gin Glu Gly Glu Val Arg He He Ser Leu Glu His Cys Gin Ser 
210 215 220 

Tyr Phe Asp Met Lys Thr He Thr Thr Arg Met He Cys Ala Gly Tyr 
225 230 235 240 

Glu Ser Gly Thr Val Asp Ser Cys Met Gly Asp Trp Gly Gly Pro Leu 
245 250 255 

Asn Ser 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2241 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 166.. 1773 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

ATTTAATACG ACTCACTATA GGGAATTTGG CCCTCGAGGA AGAATTCGGC ACGAGGCTGC 

GGCGCACTGT GAGGGAGTCG CTGTGATCCG GGGCCCCGAA CCCGACTGGA GCTGAAGCGC 

AGGCTGCGGG GCGCGGAGTC GGGAGGCCTG AGTGTTCCTT CCAGC ATG TCG GAG 

Met Ser Glu 
1 

GGG GAG TCC CAG ACA GTA CTT AGC AGT GGC TCA GAC CCA AAG GTA GAA 
Gly Glu Ser Gin Thr Val Leu Ser Ser Gly Ser Asp Pro Lys Val Glu 
5 10 15 

TCT TCA TCT TCA GCT CCT GGC CTG ACA TCA GTG TCA CCT CCT GTG ACC 270 
Ser Ser Ser Ser Ala Pro Gly Leu Thr Ser Val Ser Pro Pro Val Thr 
20 25 30 35 

TCC ACA ACC TCA GCT GCT TCC CCA GAG GAA GAA GAA GAA AGT GAA GAT 318 
Ser Thr Thr Ser Ala Ala Ser Pro Glu Glu Glu Glu Glu Ser Glu Asp 
40 45 50 



120 



GAG TCT GAH ATT TTG G/JV GAG TCG CCC TGT GGG CGC TGG CAG AAG AGG 366 
Glu Ser Glu He Leu Glu Glu Ser Pro cys Gly Arg Trp Gin Lys Arg 
55 60 65 

CGA GAA GAG GTG AAT CAA CGG AAT GTA CCA GGT ATT GAC AGT GCA TAC 414 
Arg Glu Glu Val Asn Gin Arg Asn Val Pro Gly He Asp Ser Ala Tyr 
70 75 80 

CTG GCC ATG GAT ACA GAG GAA GGT GTA GAG GTT GTG TGG AAT GAG GTA 462 
Leu Ala Met Asp Thr Glu Glu Gly Val Glu Val Val Trp Asn Glu Val 
85 90 95 

CAG TTC TCT GAA CGC AAG AAC TAC AAG CTG CAG GAG GAA AAG GTT TGT 510 
Gin Phe Ser Glu Arg Lys Asn Tyr Lys Leu Gin Glu Glu Lys Val Cys 
100 105 110 115 

GCT GTG TTT GAT AAT TTG ATT CAA TTG GAG CAT CTT AAC ATT GTT AAG 558 
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Ala Val Phe Asp Asn Leu He Gin Leu Glu His Leu Asn He Val Lys 
120 125 130 

TTT CAC AAA TAT TOG GCT GAG ATT AAA GAG AAC AAG GCC AGG GTC ATT 606 
Phe His Lys Tyr Trp Ala Asp He Lys Glu Asn Lys Ala Arg Val He 
135 140 145 

TTT ATC ACA GGA TAG ATG TCA TCT GG6 AGT CTG AAG CAA TTT CTG AAG 654 
Phe He Thr Gly Tyr Met Sex Ser Gly Ser Leu Lys Gin Phe Leu Lys 
150 155 160 

AAG ACC CAA AAG AAC CAC CAG ACG ATG AAT GAA AAG GCA TGG AAG CGT 702 
Lys Thr Gin Lys Asn His Gin Thr Met Asn Glu Lys Ala Trp Lys Arg 
165 170 175 

TGG TGC ACA CAA ATC CTC TCT GCC CTA AGC TAC CTG CAC TCC TGT GAC 750 
Trp Cys Thr Gin He Leu Ser Ala Leu Ser Tyr Leu His Ser Cys Asp 
180 185 190 195 

CCC CCC ATC ATC CAT GGG AAC CTG ACC TGT GAC ACC ATC TTC ATC CAG 798 
Pro Pro He He His Gly Asn Leu Thr Cys Asp Thr He Phe He Gin 
200 205 210 

CAC AAC GGA CTC ATC AAG ATT GGC TCT GTG GCT CCT GAC ACT ATC AAC 846 
His Asn Gly Leu He Lys He Gly Ser Val Ala Pro Asp Thr He Asn 
215 220 225 

AAT CAT GTG AAG ACT TGT CGA GAA GAG CAG AAG AAT CTA CAC TTC TTT 894 
Asn His Val Lys Thr Cys Arg Glu Glu Gin Lys Asn Leu His Phe Phe 
230 235 240 

GCA CCA GAG TAT GGA GAA GTC ACT AAT GTG ACA ACA GCA GTG GAC ATC 942 
Ala Pro Glu Tyr Gly Glu Val Thr Asn Val Thr Thr Ala Val Asp He 
245 250 255 

TAC TCC TTT GGC ATG TGT GCA CTG GGG ATG GCA GTG CTG GAG ATT CAG 990 
Tyr Ser Phe Gly Met Cys Ala Leu Gly Met Ala Val Leu Glu He Gin 
260 265 270 .275 

GGC AAT GGA GAG TCC TCA TAT GTG CCA CAG GAA GCC ATC AGC AGT GCC 1038 
Gly Asn Gly Glu Ser Ser Tyr Val Pro Gin Glu Ala He Ser Ser Ala 
280 285 290 



ATC CAG CTT CTA GAA GAC CCA TTA CAG AGG GAG TTC ATT CAA AAG TGC 
He Gin Leu Leu Glu Asp Pro Leu Gin Arg Glu Phe He Gin Lys Cys 



1086 i" 
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295 300 305 

CTG CAG TCT GAG CCT OCT CGC AGA CCA ACA GCC AGA GAA CTT CTG TTC 
Leu Gin ser Glu Pro Ala Arg Arg Pro Thr Ala Arg Glu Leu Leu Phe 
310 315 320 

CAC CCA GCA TTG TTT GAA GTG CCC TCG CTC AAA CTC CTT GCG GCC CAC 
His pro Ala Leu Phe Glu Val Pro Ser Leu Lys Leu Leu Ala Ala His 
325 330 335 

TGC ATT GTG GGA CAC CAA CAC ATG ATC CCA GAG AAC GCT CTA GAG GAG 
CVS He val Gly His Gin His Met He Pro Glu Asn Ala Leu Glu Glu 
340 345 350 355 

ATC ACC AAA AAC ATO GAT ACT AGT GCC GTA CTG GCT GAA ATC CCT GCA 
He Thr Lys Asn Met Asp Thr Ser Ala Val Leu Ala Glu He Pro Ala 
360 365 370 

GGA CCA GGA AGA GAA CCA GTT CAG ACT TTG TAG TCT CAG TCA CCA GCT 
Gly pro Gly Arg Glu Pro Val Gin Thr Leu Tyr Ser Gin Ser Pro Ala 
375 380 385 

CTG GAA TTA GAT AAA TTC CTT GAA GAT GTC AGG AAT GGG ATC TAT CCT 
Leu Glu Leu Asp Lys Phe Leu Glu Asp Val Arg Asn Gly He Tyr Pro 
390 395 400 

CTG ACA GCC TTT GGG CTG CCT CGG CCC CAG CAG CCA CAG CAG GAG GAG 
Leu Thr Ala Phe Gly Leu Pro Arg Pro Gin Gin Pro Gin Gin Glu Glu 
405 415 

GTC ACA TCA CCT GTC GTG CCC CCC TCT GTC AAG ACT CCG ACA CCT GAA 
val Thr ser Pro Val Val Pro Pro Ser Val Lys Thr Pro Thr Pro Glu 
420 425 430 435 

CCA GCT GAG GTG GAG ACT CGC AAG GTG GTG CTG ATG CAG TGC AAC ATT 
pro Ala Glu val Glu Thr Arg Lys Val Val Leu Met Gin Cys Asn He 
440 445 450 

GAG TCG GTC GAG GAG GGA GTC AAA CAC CAC CTC ACA CTT CTG CTG AAG 
Glu ser Val Glu Glu Gly Val Lys His His Leu Thr Leu Leu Leu Lys 
455 460 465 

TTG GAG GAC AAA CTC AAC CGG CAC CTC AGC TCT GAC CTC ATC CCA AAT 
Leu Glu Asp Lys Leu Asn Arg His Leu Ser Cys Asp Leu Met Pro Asn 
47b 475 480^^ 



1134 



1182 



1230 



1278 



1326 



1374 



1422 



1470 



1518 



1566 



1614 
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GAG AAT ATC CCC GAG TTG GCG GCT GAG CTG GTG GAG CTG GGC TTC ATT 
Glu Asn lie pro Glu Leu Ala Ala Glu Leu Val Gin Leu Gly Phe lie 
485 490 495 

AGT GAG GCT GAG GAG AGO CGG TTG ACT TCT CTG CTA GAA GAG ACC TTG 
Ser Glu Ala Asp Gin Ser Arg Leu Thr Ser Leu Leu Glu Glu Thr Leu 
500 505 510 515 

AAC AAG TTC AAT TTT GCC AGG AAC AGT ACC CTC AAC TCA GCC GCT GTC 
Asn Lys Phe Asn Phe Ala Arg Asn Ser Thr Leu Asn Ser Ala Ala Val 
520 525 530 

ACC GTC TCC TCT TAGAGCTCAC TCGGGCCAGG CCCTGATCTG CGCTGTGGCT 
Thr Val Ser Ser 
535 

GTCCCTGGAC GTGCTGCAGC CCTCCTGTCC CTTCCCCCCA GTCAGTATTA CCCTGTGAAG 
CCCCTTCCCT CCTTTATTAT TCAGGAG6GC TGGGGGGGCT CCCTGGTTCT GAGCATCATC 
CTTTCCCCTC CCCTCTCTTC CTCCCCTCTG CACTTTGTTT ACTTGTTTTG CACAGACGTG 
GGCCTGGGCC TTCTCAGCAG CCGCCTTCTA GTTGGGGGCT AGTCGCTGAT CTGCCGGCTC 
CCGCCCAGCC TGTGTGGAAA GGAGGCCCAC GGGCACTAGG GGAGCCGAAT TCTACAATCC 
CGCTGGGGCG GCCGGGGCGG GAGAGAAAGG TGGTGCTGCA GTGGTGGCCC TGGGGGGCCA 
TTCGATTCGC CTCAGTTGCT GCTGTAATAA AAGTCTACTT TTTGCTAAAA AAAAAAAAAA. 
AAAAAAAAAA A 



1662 

1710 

1758 

1810 

1870 
1930 
1990 
2050 
2110 
2170 
2230 
2241 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 535 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Ser Glu Gly Glu Ser Gin Thr Val Leu Ser Ser Gly Ser Asp Pro 
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15 10 15 

Lys Val Glu Ser Ser Ser Ser Ala Pro Gly Leu Thr Ser Val Ser Pro 
20 25 30 

Pro Val Tbr Ser Thr Thr Ser Ala Ala Ser Pro Glu Glu Glu Glu Glu 
35 40 45 

Ser Glu Asp Glu Ser Glu lie Leu Glu Glu Ser Pro Cys Gly Arg Trp 
50 55 60 

Gin Lys Arg Arg Glu Glu Val Asn Gin Arg Asn Val Pro Gly lie Asp 
65 70 75 80 

Ser Ala Tyr Leu Ala Met Asp Thr Glu Glu Gly Val Glu Val Val Trp 
85 90 95 

Asn Glu Val Gin Phe Ser Glu Arg Lys Asn Tyr Lys Leu Gin Glu Glu 
100 105 110 

Lys Val Cys Ala Val Phe Asp Asn Leu He Gin Leu Glu His Leu Asn 
115 120 125 

He Val Lys Phe His Lys Tyr Trp Ala Asp He Lys Glu Asn Lys Ala 
130 135 140 

Arg Val He Phe He Thr Gly Tyr Met Ser Ser Gly Ser Leu Lys Gin 
145 150 155 160 

Phe Leu Lys Lys Thr Gin Lys Asn His Gin Thr Met Asn Glu Lys Ala 
165 170 175 

Trp Lys Arg Trp Cys Thr Gin He Leu Ser Ala Leu Ser Tyr Leu His 
180 185 190 

Ser Cys Asp Pro Pro He He His Gly Asn Leu Thr Cys Asp Thr He 
195 200 205 

Phe He Gin His Asn Gly Leu He Lys He Gly Ser Val Ala Pro Asp 
210 215 220 

Thr He Asn Asn His Val Lys Thr Cys Arg Glu Glu Gin Lys Asn Leu 
225 230 235 240 



His Phe Phe Ala Pro Glu Tyr Gly Glu Val Thr Asn Val Thr Thr Ala 
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245 250 255 

Val Asp He Tyr Ser Phe Gly Met Cys Ala Leu Gly Met Ala Val Leu 
260 265 270 

Glu He Gin Gly Asn Gly Glu Ser Ser Tyr Val Pro Gin Glu Ala He 
275 280 285 

Ser Ser Ala He Gin Leu Leu Glu Asp Pro Leu Gin Arg Glu Phe He 
290 295 300 

Gin Lys Cys Leu Gin Ser Glu Pro Ala Arg Arg Pro Thr Ala Arg Glu 
305 310 315 320 

Leu Leu Phe His Pro Ala Leu Phe Glu Val Pro Ser Leu Lys Leu Leu 
325 330 335 

Ala Ala His cys He Val Gly His Gin His Met He Pro Glu Asn Ala 
340 345 350 

Leu Glu Glu He Thr Lys Asn Met Asp Thr Ser Ala Val Leu Ala Glu 
355 360 365 

He Pro Ala Gly Pro Gly Arg Glu Pro Val Gin Thr Leu Tyr Ser Gin 
370 375 380 

Ser Pro Ala Leu Glu Leu Asp Lys Phe Leu Glu Asp Val Arg Asn Gly 
385 390 395 400 

He Tyr Pro Leu Thr Ala Phe Gly Leu Pro Arg Pro Gin Gin Pro Gin 
405 410 415 

Gin Glu Glu Val Thr Ser Pro Val Val Pro Pro Ser Val Lys Thr Pro 
420 425 430 

Thr Pro Glu Pro Ala Glu Val Glu Thr Arg Lys Val Val Leu Met Gin 
435 440 445 

Cys Asn He Glu Ser Val Glu Glu Gly Val Lys His His Leu Thr Leu 
450 455 460 

Leu Leu Lys Leu Glu Asp Lys Leu Asn Arg His Leu Ser Cys Asp Leu 
465 470 475 480 

Met Pro Asn Glu Asn He Pro Glu Leu Ala Ala Glu Leu Val Gin Leu 
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485 490 495 

Gly Phe lie Ser Glu Ala Asp Gin Ser Arg Leu Thr Ser Leu Leu Glu 
500 505 510 

Glu Thr Leu Asn Lys Phe Asn Phe Ala Arg Asn Ser Thr Leu Asn Ser 
515 520 525 

Ala Ala Val Thr Val Ser Ser 
530 535 



1 



(2) INFORMATION FOR SEQ ID NO: 11: 

(il SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IP^NO:^!.: 



GCACAGTCGA CCAAGCCGGA GTCGCAGAG 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:,12: 
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GCACAAAGCT TGCCAGGAGG GGTCTGGCTG 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTldfe: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GCACAACCAT GGCCAAGCCG GAGTCGCAGG AG 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SSQ ID NO: 14: 



GCACAAGATC TCCAGGAGGG GTCTGGCTG 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Lys Pro Glu Ser Gin Glu Ala Ala Pro Leu Ser Gly Pro Cys 
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5 10 
(2) INFORMATION FOR SEQ ID NO; 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Glu Asp Ala Glu Leu Gly Arg Trp Pro Trp Gin Gly Ser Leu Arg Leu Trp Asp 
5 10 15 

Cys 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Gly Tyr lie Lys Glu Asp Glu Ala Leu Pro Ser Pro His Thr Leu Gin Cys 
5 10 15 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
GCACAGGTAC CGAGGCCATG GGCGCGCGC 
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(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
GCACATCTAG ATCAGTGGTG GTGGTGGTGG TGGACCGGCC CCAGGAGTGG 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
GCACAGCGGC CGCGAGGCCA TGGGCGCGCG C 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
GCACAGCGGC CGCTCAGTGG TGGTGGTGGT GGTGCCAGGA GGGGTCTGGC TG 
(2) INFORMATION FOR SEQ ID NO: 22: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
CTGACTTCCA TGCCATCCTT 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
GCTCACGACT CCAATCTGAT 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 

Arg lie Val Gly Gly 
5 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 959 base pairs 

(B) TYPE: nucleic acid 
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te) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2. .856 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

C GAC CTA TTG TCA GGG CCC TGC GGT CAC AGG ACC ATC CCT TCC CGT 46 
Asp Leu Leu Ser Gly Pro Cys Gly His Arg Thr He Pro Ser Arg 
15 10 15 

ATA GTG GGT GGC GAT GAT GCT GAG CTT GGC CGC TGG CCG TGG CAA GGG 94 
He Val Gly Gly Asp Asp Ala Glu Leu Gly Arg Trp Pro Trp Gin Gly 



20 



25 



30 



AGC CTG CGT GTA TGG GGC AAC CAC TTA TGT GGC GCA ACC TTG CTC AAC 
Ser Leu Arg Val Trp Gly Asn His Leu Cys Gly Ala Thr Leu Leu Asn 
35 40 45 



142 



CGC CGC TGG GTG CTT ACA GCT GCC CAC TGC TTC CAA AAG GAT AAC GAT 
Arg Arg Trp Val Leu Thr Ala Ala His Cys Phe Gin Lys Asp Asn Asp 
50 55 60 



190 



CCT TTT GAC TGG ACA GTC CAG TTT GGT GAG CTG ACT TCC AGG CCA TCT 
Pro Phe Asp Trp Thr Val Gin Phe Gly Glu Leu Thr Ser Arg Pro Ser 
65 70 75 



238 



CTC TGG AAC CTA CAG GCC TAT TCC AAC CGT TAC CAA ATA GAA GAT ATT 
Leu Trp Asn Leu Gin Ala Tyr Ser Asn Arg Tyr Gin He Glu Asp He 
80 85 90 95 



286 



TTC CTG AGC CCC AAG TAC TCG GAG CAG TAT CCC AAT GAC ATA GCC CTG 
Phe Leu Ser Pro Lys Tyr Ser Glu Gin Tyr Pro Asn Asp He Ala Leu 
100 105 110 



334 



CTG AAG CTG TCA TCT CCA GTC ACC TAC AAT AAC TTC ATC CAG CCC ATC 
Leu Lys Leu Ser Ser Pro Val Thr Tyr Asn Asn Phe He Gin Pro He 
115 120 125 



382 
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478 



526 



574 



TGC CTC CTC AAC TCC ACG TAC AAG TTT GAG AAC CGA ACT GAG TGC TGG 430 
Cys Leu Leu Asn Ser Thr Tyr Lys Phe Glu Asn Arg Thr Asp Cys Trp 
130 135 140 

GTG ACC GGC TGG GGG GCT ATT GGA GAA GAT GAG AGT CTG CCA TCT CCC 
Val Thr Gly Trp Gly Ala He Gly Glu Asp Glu Ser Leu Pro Ser Pro 
145 150 155 

AAC ACT CTC CAG GAA GTC CAG GTA GCT ATT ATC AAC AAC AGC ATG TGT 
Asn Thr Leu Gin Glu Val Gin Val Ala He He Asn Asn Ser Met Cys 
160 1S5 170 175 

AAC CAT ATG TAC AAA AAG CCA GAC TTC CGC ACG AAC ATC TGG GGA GAC 
Asn His Met Tyr Lys Lys Pro Asp Phe Arg Thr Asn He Trp Gly Asp 
180 185 190 

ATG GTT TCC GCT GGC ACT CCT GAA GGT GGC AAG GAT GCC TGC TTT GGT 622 
Met Val Cys Ala Gly Thr Pro Glu Gly Gly Lys Asp Ala Cys Phe Gly 
195 200 205 

GAC TCG GGA GGA CCC TTG GCC TGC GAC CAG GAT ACG GTG TCG TAT CAG 670 
ASP Ser Gly Gly Pro Leu Ala Cys Asp Gin Asp Thr Val Trp Tyr Gin 
210 215 220 

GTT GGA GTT GTG AGC TGG GGA ATA GGC TGT GGT CGC CCC AAT CGC CCT 
Val Gly val Val Ser Trp Gly He Gly Cys Gly Arg Pro Asn Arg Pro 
225 230 235 

GGA GTC TAT ACC AAC ATC AGT CAT CAC TAC AAC TGG ATC CAG TCA ACC 
Gly Val Tyr Thr Asn He Ser His His Tyr Asn Trp He Gin Ser Thr 
240 245 ' 250 255 

ATG ATC CGC AAT GGG CTG CTC AGG CCT GAC CCA GTC CCC TTG CTA CTG 814 
Met He Arg Asn Gly Leu Leu Arg Pro Asp Pro Val Pro Leu Leu Leu 
260 265 270 

TTT CTT ACT CTG GCC TGG GCT TCC TCT TTG CTG AGG CCT GCC 856 
Phe Leu Thr Leu Ala Trp Ala Ser Ser Leu Leu Arg Pro Ala 
275 280 285 

TCAGCCCACA CGTGTACGTC ACACCTGTGA GGTCAGGGTG TCTCTCTTTT GTATCTTGCT 916 



718 



766 



TGCTAATAAA CCTGTTAATA TTTAAAAAAA AAAAAAAAAA AAA 



959 
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(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 285 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Asp Leu Leu Ser Gly Pro Cys Gly His Arg Thr lie Pro Ser Arg lie 
15 10 15 

val Gly Gly Asp Asp Ala Glu Leu Gly Arg Trp Pro Trp Gin Gly Ser 
20 25 30 

Leu Arg Val Trp Gly Asn His Leu Cys Gly Ala Thr Leu Leu Asn Arg 
35 40 45 

Arg Trp Val Leu Thr Ala Ala His Cys Phe Gin Lys Asp Asn Asp Pro 
50 55 60 

Phe Asp Trp Thr Val Gin Phe Gly Glu Leu Thr Ser Arg Pro Ser Leu 
65 70 75 80 

Trp Asn Leu Gin Ala Tyr Ser Asn Arg Tyr Gin lie Glu Asp lie Phe 
85 90 95 

Leu Ser Pro Lys Tyr Ser Glu Gin Tyr Pro Asn Asp lie Ala Leu Leu 
100 105 110 

Lys Leu Ser Ser Pro Val Thr Tyr Asn Asn Phe He Gin Pro He Cys 
115 120 125 

Leu Leu Asn Ser Thr Tyr Lys Phe Glu Asn Arg Thr Asp Cys Trp Val 
130 135 140 

Thr Gly Trp Gly Ala He Gly Glu Asp Glu Ser Leu Pro Ser Pro Asn 
145 150 155 160 

Thr Leu Gin Glu Val Gin Val Ala He He Asn Asn Ser Met Cys Asn 
165 170 175 

His Met Tyr Lys Lys Pro Asp Phe Arg Thr Asn He Trp Gly Asp Met 
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180 



185 190 



Val Cys Ala Gly Thr Pro Glu Gly Gly Lys Asp Ala Cys Phe Gly Asp 
195 200 205 

Ser Gly Gly Pro Leu Ala Cys Asp Gin Asp Thr Val Trp Tyr Gin Val 
210 215 220 

Gly val val Ser Trp Gly He Gly Cys Gly Arg Pro Asn Arg Pro Gly 
225 230 235 240 

Val Tyr Thr Asn He Ser His His Tyr Asn Trp He Gin Ser Thr Met 
245 250 , 255 

He Arg Asn Gly Leu Leu Arg Pro Asp Pro Val Pro Leu Leu Leu Phe 
260 265 270 

Leu Thr Leu Ala Trp Ala Ser Ser Leu Leu Arg Pro Ala 
275 280 285 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3866 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 
AGTGAGTCTC CTGCCTCAGC CTCCCAAGTA GCTGGGACTT CAGGTGTGTG CCACCATCCT 
CAGCTAATTT TTTTTTTTTT TTTTTTTTTG AGAAGGAGTC TTGCTCTGTC GCCCAGGCTG 
GAGTGCAGTG GCGCGATCTT CCAGGCCCCA CCGGGCCCTC AGGAAGGCCT TGCCTACCTG 
CTTTAAGGG6 ACTCCTGGCT CAGGGCCAGG CCCCTGGTGC TGGAGGAGGT GGTGGGTGGA 
GGGCAGGGGG CACCAAGCGG GCAGCCAGGA CCCCCGGGCT GCAGACAA6A AAAGGACTGT 
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GGGGTCCACC GGGTCTGGGC CACATCAAGG AATGTGGTTG AAGACCCGCC CTTAGGAGCT 
GAAAGCCAGG GCGCTACCAG GCCTGAGAGG CCCCAAACAG CCCTTGGGCC TGGTTTGGGA 
GGATTAAGCT GGAGCTCCCA ACCCGCCCTG CCCCCAGGGG GCGACCCCGG GCCCGGCGCG 
AGAGGAGGCA GAGGGGGCGT CAGGCCGCGG GAGAGGAGGC CATGGGCGCG CGCGGGGCGC 
TGCTGCTGGC GCTGCTGCTG GCTCGGGCTG GACTCAGGAA GCCGGGTGAG CTCGGGGCGC 
TGCTGGC6GG ATCGGGAGGC GGGGGAGCGG TGGGGAGGAC GGGAGGTGGA GGCCGCGGGG 
AGTCACTTCT TGTCTCCCGC AGAGTCGCAG GAGGCGGCGC CGTTATCAGG TAGGGCGCCC 
AGGACGCGCG ATTCCTGCCA GGGCCGTTGG GCCGAGGTGG ACGGGGGGCG GTGAGGGGGT 
AGAGGGGGGC CTTTACTGCT CTCTCGCCCC CGCCCCCGGG ATCGAGAACT CTGTTGGCGT 
GGAAAGTAAC TAACGGACGC TGGAGGGGGA TGGGCGGGCC CTGCAGAGCA CGTGG6AGGA 
TCTCCAGTGT CACCTACTTC CTGCTGCACA CACGCGAGGG GACCCTGGGT GGGCAAAAAC 
GTGCTTTCCC GGACGGGGTT GAAGGGGAGA AAGGGAGAGG TCGGGCTTGG GGGGCTGCCT 
CCCGCGGCTC AGCAGTTCCT CTGACCATCC GAGGACCATG CGGCCGACGG GTCATCACGT 
CGCGCATCGT GGGTGGAGAG GACGCCGAAC TCGGGCGTTG GCCGTGGCAG GGGAGCCTGC 
GCCTGTGGGA TTCCCACGTA TGCGGAGTGA GCCTGCTCAG CCACCGCTGG GCACTCACGG 
CGGCGCACTG CTTTGAAACG TGAGTGGGGG TGCGAACGGA GGGGTGCGGG GACGGGCAGG 
AACAGGGCTG GAGGGAGTGC CACCGAACTT TACCTCTGGT CTGATGCCAG ACTTGGGCGT 
GAAAGTTGTG CGTGGATGCG GCCTGGTGTT CTCCTGAGCC CCAGGCTGTG CTGCAGCCGG 
TTACACCCAC TCCAGTTCCC TTTGGGTCTC CTGGAGGGAA CCCTGTTCAG GTTATTCCAG 
AATGTTCTTC CAGAACATTT CCACACACTT TTGGGTATTC TCTCCCTTTT TCTTTCAACC 
CAAAGTTCAC CACTGACCAT CCCACCCTCA TCCCCCCTCC TGGTGGACGG TGCGGTACAG 
TGTGGG6CAC TGAGCCAAGG CCAGCACCCC C6GGCCGCTG TGTGGACTCC ATCCTGCCAA 
TCCCACATTG GCGTGGTGCA TCTCCCCATT CCTCCTTGGG CTGCATGGGG GTGCCCCTGG 



360 

.420 

540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
11-40 
1200 
1260 
1320 
13 80 
1440 
1500 
1560 
1620 
1680 
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AGGCCTTGGC TCAATGCAAG GCTCCTTGGG ACAGCTCTGG GAGGTGACAA GACCCCACCC 1740 

TTCTGCTGCA GGAGCAGGTC CTAGGACTTT GGTTGTGGTC TGTCTGGGCT CCTTCATTTC 1800 

TGCAGGGGAC CCTGGGTGTT AGCAAGTAGC AGCAACACCA CAGTTTCCCC TCCTGCACTG 1860 

GACCCCAGTT GTGCTCAGGT AGCCAGCCCT CCATCCAGGG CCCCTGACTG CTCTCTTCTC 1920 

TTCTGCCAGC TATAGTGACC TTAGTGATCC CTCCGGGTGG ATGGTCCAGT TTGGCCAGCT 1980 

GACTTCCATG CCATCCTTCT GGAGCCTGCA GGCCTACTAC ACCCGTTACT TCGTATCGAA 2040 

TATCTATCTG AGCCCTCGCT ACCTGGGGAA TTCACCCTAT GACATTGCCT TGGTGAAGCT 2100 

GTCTGCACCT GTCACCTACA CTAAACACAT CCAGCCCATC TGTCTCCAGG CCTCCACATT 2160 

TGAGTTTGAG AACCGGACAG ACTGCTGGGT GACTGGCTGG GGGTACATCA AAGAGGATGA 2220 

GGGTGAGGCT GGGGACAG6C GGGTCAGGGA GGAACTGTCT TTGTTCACCT GTTCCCCTGC 2280 

ATAGGCACAA TAGCCCCCTG CTTGGTCTGG GGGTGCAGGC TATGCCCCTC TTGCTTGCAG 2340 

TCTCTCCTCA CCTGCCAGGG CAGGGACCAA ACACCCAGTT CTCTCCCTTC CAGGGGCTGT 2400 

GGGGGCCAGA AGGAGAGTGT GAGAGGGAGG CCAGTTTGGC GCAAGCCTGT GGGTGGTGCG 2460 

GTGGTGGAGG GGTTCTGGAG GGCTT6GCGA CATAAACCTC ATACTTGGAT TTATTCCTGC 2520 

ATCTTTCCAC CTCCCCCAGT GCTCACCAAT GCCCCAGGCA TCACCAGCTT GCCCCTTCCC 2580 

CCAAGGTCTG GCTTTGGATG CTTATGTGAA CACCGTTTTA AGTTGCCTTG GCCCCTTCCT 2640 

CGGTTCCTTT TTGGCTGAGG AATCTCTCCA TGGCTGCAGG CAGGGCCATT GTTGCCATTC 2700 

TACAGATAGG GAAAGTGCGG CTGGGGGAGC TCTGACAGCT GTCCCTCCCC GGGGCCTTCT 2760 

GTGATGCTGC TGAGGGCCTC TGTTGTGCTG GGGTCTGGGT TGGAGCTGGG GGTAATGGAG 2820 

ATGAACCTGC CAGGCACAGT GGGTGCCCCA GGGCCCCCAC CCCCGCAGCC TATGCCATCC 2880 

CTCCATAGAG GGGCCTCAGG TTGCTGTCTC TCTCCTTCCC ACTATCGTCC GCACAGCACT 2940 

GCCATCTCCC CACACCCTCC AG6AAGTTCA GGTCGCCATC ATAAACAACT CTATGTGCAA 3000 

CCACCTCTTC CTCAAGTACA GTTTCCGCAA GGACATCTTT GGAGACATGG TTTGTGCTGG 3060 
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CAATGCCCAA GGCGGGAAGG ATGCCTGCTT CGTGAGTGTC CTTGCCACCA CTCCCAGCCC 3120 

AGGAAAGCAT CCTCTGTCCC TGTGCCTTAT TTGACCCTCA TGCCAACCCC GGGAGGTGGA 3180 

GACTGTTGCC CCACTCTGCA GATGCAGAAA CGGAGGCTTG GCTGCTGCCA GGGGGAGGAG 3240 

GAGGATGTGC ACCCAGTCTA CCCAGCCCCA TAGCCCTTCC CACTCTCAGC CCCTCCCCTG 3300 

CCCCACTCAC TCTGCCCCAG GCTGACCTCA GCCCCGCTGC TCCCCAGGGT GACTCAGGTG 3360 

GACCCTTGGC CTGTAACAAG AATGGACTGT GGTATCAGAT TGGAGTCGTG AGCTGGGGAG 3420 

TGGGCTGTGG TCGGCCCAAT CGGCCCGGTG TCTACACCAA TATCAGCCAC CACTTTGAGT 3480 

GGATCCAGAA GCTGATGGCC CAGAGTGGCA TGTCCCAGCC AGACCCCTCC TGGCCGCTAC 3540 

TCTTTTTCCC TCTTCTCTGG GCTCTCCCAC TCCTGGGGCC GGTCTGAGCC TACCTGAGCC 3600 

CATGCAGCCT GGGGCCACTG CCAAGTCAGG CCCTGGTTCT CTTCTGTCTT GTTTGGTAAT .3660 

AAACACATTC CAGTTGATGC CTTGCAGGGC ATTCTTCAAA AGCAGTGGCT TCATGGACAG 3720 

CTCATTCTCT CTTGTGCAGA CAGCCTGTCT GTGCCCCTGG CTCACACCCA CATCTGTTCT 3780 

6CACCATAGA ACCATCTGGT TATTTCGATC AGAAAGAGAA TTGTGTGTTG CCCAGGCTGG 3840 

TCTTGAACGC CTAGGGTGTC TCGATC 3866 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1165 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 



WO9W36054 PCT/AU98/00085 

-88- 

CTGAACCGGG TTGTGGGCGG CGAGGACAGC ACTGACAGCG AGTGGCCCTG GATCGTGAGC 60 

ATCCAGAAGA ATGGGACCCA CCACTGCGCA GGTTCTCTGC TCACCAGCCG CTGGGTGATC 120 

ACTGCTGCCC ACTGTTTCAA GGACAACCTG AACAAACCAT ACCTGTTCTC TGTGCTGCTG 180 

GGGGCCTGGC AGCTGGGGAA CCCTGGCTCT "CGGTCCCAGA AGGTGGGTGT TGCCTGGGTG 240 

GAGCCCCACC CTGTGTATTC CTGGAAGGAA GGTGCCTGTG CAGACATTGC CCTGGTGCGT 300 

CTCGAGCGCT CCATACAGTT CTCAGAGCGG GTCCTGCCCA TCTGCCTACC TGATGCCTCT 360 

ATCCACCTCC CTCCAAACAC CCACTGCTGG ATCTCAGGCT GGGGGAGCAT CCAAGATGGA 420 

GTTCCCTTGC CCCACCCTCA GACCCTGCAG AAGCTGAAGG TTCCTATCAT CGACTCGGAA 480 

GTCTGCAGCC ATCTGTACTG GCGGGGAGCA GGACAGGGAC CCATCACTGA GGACATGCTG 540 

TGTGCCGGCT ACTTGGAGGG GGAGCGGGAT GCTTGTCTGG GCGACTCCGG GGGCCCCCTC 600 

ATGTGCCAGG TGGACGGCGC CTGGCTGCTG GCCGGCATCA TCAGCTGGGG CGAGGGCTGT 660 

GCCGAGCGCA ACAGGCCCGG GGTCTACATC AGCCTCTCTG CGCACCGCTC CTGGGTG6AG 720 

AAGATCGTGC AAGGGGTGCA GCTCCGCGGG CGCGCTCAGG GGGGTGGGGC CCTCAGGGCA 780 

CCGAGCCAGG GCTCTGGGGC CGCCGCGCGC TCCTAGGGCG CAGCGGGACG CGGGGCTCGG 840 

ATCTGAAAGG CGGCCAGATC CACATCTGG?l TCTGGATCTC CGGCGGCCTC GGGC6GTTTC 900 

CCCCGCCGTA AATAGGCTCA TCTACCTCTA CCTCTGGGGG CCCGGACGGC TGCTGCGGAA 960 

AGGAAACCCC CTCCCCGACC CGCCCGACGG CCTCAGGCCC CGCCTCCAAG GCATCAGGCC 1020 

CCGCCCAACG GCCTCATGTC CCCGCCCCCA CGACTTCCGG CCCCGCCCCG GGCCCCAGCG 1080 

CTTTTGTGTA TATAAATGTT AATGATTTTT ATAGGTATTT GTAACCCTGC CCACATATCT 1140 

TATTTATTCC TCCAATTTCA ATAAA 1165 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 933 base pairs 
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(B) TYPE: nucleic acid 

(C) STRAJMDEDNESS : single ^ 

(D) TOPOLOGY: linear 

I 

(ii) MOLECULE TYPE: DNA J 

s 

I: ■ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 

AATGCGGCCA CTCCAAGGAG GCCGGGAGGA TTGTGGGAGG CCAAGACACC CAGGAAGGAC 60^ 

GCTGGCCGTG GCAGGTTGGC CTGTGGTTGA CCTCAGTGGG GCATGTATGT GGGGGCTCCC 12 0| 

TCATCCACCC ACGCTGGGTG CTCACAGCCG CCCACTGCTT CCTGAGGTCT GAGGATCCCG 18a^ 

GGCTCTACCA TGTTAAAGTC GGAGGGCTGA CACCCTCACT TTCAGAGCCC CACTCGGCCT 240 

TGGTGGCTGT GAGGAGGCTC CTGGTCCACT CCTCATACCA TGGGACCACC ACCAGCGGGG 300 

ACATTGCCCT GATGGAGCTG GACTCCCCCT TGCAGGCCTC CCAGTTCAGC CCCATCTGCC 360 

TCCCAGGACC CCAGACCCCC CTCGCCATTG GGACCGTGTG CTGGGTAAAC GGGCTGGGGG 420 

TCCACTCAGG AGAGGCCCTG GCGAGTGTCC TTCAGGAGGT GGCTGTGCCC CTCCTGGACT 480 

CGAACATGTG TGAGCTGATG TACCACCTAG GAGAGCCCAG CCTGGCTGGC CAGCGCCTCA 540* 

TCCAGGACGA CATGCTCTGT GCTGGCTCTG TCCAGGGCA;^ GAAAGACTCC TGCCAGGGTG 600 

ACTCCGGGGG GCCGCTGGTC TGCCCCATCA ATGATACGTG GATCCAGGCC GGCATTGTGA 660 

GCTGGGGATT CGGCTGTGCC CGGCCTTTCC GGCCTGGTGT CTACACCCAG GTGCTAAGCT 720 

ACACAGACTG GATTCAGAGA ACCCTGGCTG AATCTCACTC AGGCATGTCT GGGGCCCGCC 780 

CAGGTGCCCC AGGATCCCAC TCAGGCACCT CCAGATCCCA CCCAGTGCTG CTGCTTGAGC 840 

TGTTGACCGT ATGCTTGCTT GGGTCCCTGT GAACCATGAG CCATGGAGTC CGGGATCCCC 900 

TTTCTGGTAG GATTGATGGA ATCTAATAAT AAA 933 

(2) INFORMATION FOR SEQ ID NO: 30: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 980 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

{ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 



CCTGTGGTCG 


CCCCAGGATG CTGAACCGAA TGGTGGGCGG GCAGGACACG CAGGAGGGCG 


60 


AGTGGCCCTG 


GCAAGTCAGC ATCCAGCGCA ACGGAAGCCA CTTCTGCGGG GGCAGCCTCA 


120 


TCGCGGAGCA 


GTGGGTCCTG ACGGCTGCGC ACTGCTTCCG CAACACCTCT GA6ACGTCCC 


180 


TGTACCAGGT 


CCTGCTGGGG GCAAGGCAGC TAGTGCAGCC GGGACCACAC GCTATGTATG 


240 


CCCGGGTGAG 


GCAGGTGGAG AGCAACCCCC TGTACCAGGG CACGGCCTCC AGCGCTGACG 


300 


TGGCCCTGGT 


GGAGCTGGAG GCACCAGTGC CCTTCACCAA TTACATCCTC CCCGTGTGCC 


360 


TGCCTGACCC 


CTCGGT6ATC TTTGAGACGG GCATGAACT(» CTGGGTCACT GGCTGGGGCA 


420 


GCCCCAGTGA 


GGAAGACCTC CTGCCCGAAC CGCGGATCCT GCAGAAACTC GCTGTGCCCA 


480 


TCATCGACAC 


ACCCAAGTGC AACCT6CTCT ACAGCAAAGA CACCGAGTTT GGCTACCAAC 


540 


CCAAAACCAT 


CAAGAATGAC ATGCTGTGCG CCGGCTTCGA GGA6GGCAAG AAGGATGCCT 


600 


6CAAGGGCGA 


CTCGGGCGGC CCCCTGGTGT GCCTCGTGGG TCAGTCGTGG CTGCAGGCGG 


660 


GGGTGATCAG 


CTGGGGTGAG GGCTGTGCCC GCCAGAACCG CCCAGGTGTC TACATCCGTG 


720 


TCACCGCCCA 


CCACAACTGG ATCCATCGGA TCATCCCCAA ACTGCAGTTC CAGCCAGCGA 


780 


GGTTGGGCGG 


CCAGAAGTGA GACCCCCGGG GCCAGGAGCC CCTTGAGCAG AGCTCTGCAC 


840 


CCAGCCTGCC 


CGCCCACACC ATCCTGCTGG TCCTCCCAGC GCTGCTGTTG CACCTGTGAG 


900 


CCCCACCAGA 


CTCATTTGTA AATAGCGCTC CTTCCTCCCC TCTCAAATAC CCTTATTTTA 


960 


TTTATGTTTC 


TCCCAATAAA 


980 
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CLAIMS: 

1. An isolated proteinaceous molecule involved in or associated with regulation of cell 
activity and/or viability comprising a sequence of amino acids encoded by a nucleotide 
sequence, at least a portion of which, is capable of being amplified by polymerase chain reaction 
(PGR) using the following primers: 

5' ACAGAATTCTGGGTIGTIACIGCIGCICAYTG3' [SEQ ID NO:l]; and 

5'ACAGAATTCAXIGGICacaC/GT/AXTCICC3' [SEQ ID N0:2]; 

or a complementary form of said primers. 

2. An isolated proteinaceous molecule according to claim 1 wherein said molecule is a 
serine proteinase con^rising an amino acid sequence substantially as set forth in SEQ ID N0:4 
or an amino acid sequence having at least 50% similarity thereto. 

3. An isolated proteinaceous molecule according to claim 1 wherein said molecule is a 
serine proteinase comprising an amino acid sequence substantially as set forth in SEQ ID NO:6 
or an amino acid sequence having at least 50% similarity thereto. 

4. An isolated proteinase molecule according to claim 1 wherein said molecule is a serine 
proteinase comprising an amino acid sequence substantially as set forth in SEQ ID NO: 8 or an 
amino acid sequence having at least about 50% similarity thereto. 

5. An isolated proteinaceous molecule according to claim 1 wherein said molecule is a 
serine proteinase comprising a sequence of amino acids encoded by a nucleotide sequence 
substantially as set forth in SEQ ID N0:3 or a nucleotide sequence having at least 50% 
similarity thereto or a nucleotide sequence capable of hybridising to the sequence set forfli in 
SEQ ID NO:3 under low stringency conditions at 42*'C. 
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6. An isolated proteinaceous molecule according to claim I wherein said molecule is a 
serine proteinase comprising a sequence of amino acids encoded by a nucleotide sequence 
substantially as set forth in SEQ ID NO:5 or a nucleotide sequence having at least 50% 
similarity thereto or a nucleotide sequence capable of hybridising to the sequence set forth in 
SEQ ID NO:5 under low stringency conditions at 42*'C. 

7. An isolated proteinaceous molecule according to claim 1 wherein said molecule is a 
serine proteinase comprising a sequence of amino acids encoded by a nucleotide sequence 
substantially as set forth in SEQ ID NO:7 or a nucleotide sequence having at least 50% 
similarity thereto or a nucleotide sequence capable of hybridising to the sequence set forth in 
SEQ ID N0:7 under low stringency conditions at 42*^0. 

8. An isolated proteinaceous molecule according to claun 1 wherein said molecule is a 
kinase comprising an amino acid sequence substantially as set forth in SEQ ID NO: 10 or having 
50% amino acid similarity thereto. 

9. An isolated proteinaceous molecule according to claim 1 wherein said molecule is a 
kinase comprising an amino acid sequence encoded by a nucleotide sequence substantially as 
set forth in SEQ ED N0:9 or a nucleotide sequence having at least 50% similarity thereto or a 
nucleotide sequence capable of hybridising to the nucleotide sequence set forth in SEQ ID 
N0:9 under low stringency conditions at42**C. 

10. An isolated nucleic acid molecule encoding a polypeptide wherein at least a portion of 
said nucleic acid molecule is capable of being amplified by polymerase chain reaction (PGR) 
using the following primers: 

5' ACAGAATTCTGGGTIGTIACIGCIGCICAYTG3' [SEQ ID NO:ll; and 
5'ACAGAATTCAXIGGICCICCiaGT/AXTCICC3' [SEQ ID N0:2]; 
or a complementary form of said primers. 
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11. An isolated nucleic acid molecule according to claim 10 wherein said polypeptide is a 
serine proteinase comprising an amino acid sequence substantiaUy as set forth in SEQ ID N0:4 
or an amino acid sequence having at least 50% similarity thereto. . 

12. An isolated nucleic acid molecule according to claim 10 wherein said polypeptide is a-^ 
serine proteinase conpiang an amino acid sequence substantially as set forth in SEQ ID N0:6 
or an amino acid sequence having at least 50% simUarity thereto. 

13. An isolated nucleic acid molecule according to claim 10 wherein said polypeptide is a 
serine proteinase con?)rising an amino acid sequence substantially as set forth in SEQ ID N0:8 
or an amino acid sequence having at least about 50% similarity thereto. 

14. An isolated nucleic acid molecule according to claim 10 comprising a sequente of 
nucleotides substantiaUy as set forth in SEQ ID NO:3 or a nucleotide sequence having at least 
50% similarity thereto or a nucleotide sequence capable of hybridising to the sequence set forth 
in SEQ ID NO:3 under low stringency conditions at 42"'C. 

15. An isolated nucleic acid molecule according to claim 10 comprising a sequence of 
nucleotides substantially as set forth in SEQ ID N0:5 or a nucleotide sequence having at least 
50% similarity thereto or a nucleotide sequence capable of hybridising to the sequence set forth 
in SEQ ID NO:5 under low sttingency conditioiis at 42°C. 

16. An isolated nucleic acid molecule according to claim 10 comprising a sequence of 
nucleotides substantially as set forth in SEQ ID N0:7 or a nucleotide sequence having at least 
50% similarity thereto or a nucleotide sequence capable of hybridising to the sequence set forth 
in SEQ ID N0:7 under low stringency conditions at 42''C. 

17. An isolated nucleic acid molecule according to claim 10 wherein said polypeptide is a 
kinase comprising an amino acid sequence substantially as set forth in SEQ ID NO: 10 or having 
50% amino acid similarity thereto. 
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18. An isolated nucleic acid molecule according to claim 17 comprising a sequence of 
nucleotides encoded by a nucleotide sequence substantially as set forth in SEQ ID N0:9 or a 
nucleotide sequence having at least 50% similarity thereto or a nucleotide sequence capable of 
hybridising to the nucleotide sequence set forth in SEQ ID NO:9 under low stringency 
conditions at 4T*C. 

19. An isolated serine proteinase encoded by a gene proximal to a cluster of genes of a 
mammalian chromosome. 

20. An isolated serine proteinase according to claim 19 wherein the mammalian 
chromosome is human chromosome 16pl3.3 or its equivalent in a non-human species. 

21. An isolated serine proteinase according to claim 20 wherein the gene cluster includes 
at least two genes having the nucleotide sequence as set forth in SEQ ID NO:3 or 5 or 28 or 
29 or 30 or a nucleotide sequence having at least 50% similarity to any one of SEQ ID NO: 3 
or 5 or 28 or 29 or 30 or a nucleotide sequence capable of hybridizing to any one of the 
sequences under low stringency conditions at 42^C. 

22. An isolated serine proteinase according to claim 20 wherein said serine proteinase is a 
short form of HELA2 having an amino acid sequence substantially as set forth in SEQ ID NO:4 
or an amino acid sequence having at least 50% similarity thereto. 

23. An isolated serine proteinase according to claim 20 wherein said serine proteinase is a 
long form of HELA2 having an amino acid sequence substantially as set forth in SEQ ID NO: 6 
or an amino acid sequence having at least 50% similarity thereto. 

24. An isolated serine proteinase according to claim 22 encoded by a nucleotide sequence 
substantially as set forth in SEQ ID NO:3 or a nucleotide sequence having at least 50% 
similarity thereto or a sequence capable of hybridizing to SEQ I NO:3 under low stringency 
conditions at 42°C, 
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25. An isolated serine proteinase according to claim 23 encoded by a nucleotide sequence 
substantially as set forth in SEQ ID N0:5 or a nucleotide sequence having at least 50% 
similarity thereto or a sequence capable of hybridizing to SEQ ID N0:5 under low stringency 
conditions at 42°G. 

26. An isolated nucleic acid molecule comprising a nucleotide sequence encoding a serine 
proteinase and corresponding to a gene proximal to a cluster of genes encoding serine 
proteinases. 

27. An isolated nucleic acid molecule according to claim 26 wherein the gene cluster 
includes at least two genes having the nucleotide sequence as set forth in SEQ ID N0:3 or 5 
or 28 or 29 or 30 or a nucleotide sequence having at least 50% similarity to any one of SEQ 
ID N0:3 or 5 or 28 or 29 or 30 or a nucleotide sequence capable of hybridizing to any one of 
the sequences under low stringency conditions at 42*'C. 

28. An isolated nucleic acid molecule according to claim 25 comprising a nucleotide 
sequence substantiaUy as set forth in SEQ ID NO:3 or SEQ ID N0:5 or a nucleotide sequence 
having at least about 50% similarity to either of SEQ ID N0:3 or SEQ ID N0:5 or a nucleotide 
sequence capable of hybridizmg to SEQ ID NO:3 or SEQ ID N0:5 under low stringency 
conditions at 42*^0. 

29. An isolated kinase conprising an amino acid sequence substantially as set forth in SEQ 
ID NO: 10 or an amino acid sequence having at least about 50% similarity thereto. 

30. An isolated kinase according to claim 29 encoded by a nucleotide sequence substantially 
as set forth in SEQ ID NO:9 or a nucleotide sequence having at least 50% similarity thereto or 
capable of hybridizing to SEQ ID NO:9 under low stringency conditions at 42T. 

31. A method of regulating cell activity and/or viability said method comprising contacting 
said cell with an activity and/or viability effective amount of a serine proteinase and/or kinase. 
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32. A method accxirding to claim 3 1 wherein the serine proteinase comprises a sequence of 
amino acids encoded by a nucleotide sequence, at least a portion of which, is capable of being 
amplified by polymerase chain reaction (PGR) using the following primers: 

5' ACAGAATTCTGGGTIGTIACIGCIGCICAYTG3' [SEQ ID NO:!]; and 

5'ACAGAATTCAXIGGICCICCIC/GT/AXTCICC3' [SEQ ID NO:2]; 

or a complementary form of said primers. 

33. A method according to claim 3 1 wherein the serine proteinase comprises an amino acid 
sequence substantially as set forth in SEQ ID NO:4 or an amino acid sequence having at least 
50% similarity thereto. 

34. A method according to claim 3 1 wherein the serine proteinase comprises an amino acid 
sequence substantially as set forth in SEQ ID NO:6 or an amino acid sequence having at least 
50% similarity thereto. 

35. A method according to claim 3 1 wherein the serine proteinase comprises an amino acid 
sequence substantially as set forth in SEQ ID N0:8 or an amino acid sequence having at least 
about 50% similarity thereto. 

36. A method according to claim 3 1 wherein the serine proteinase comprises a sequence of 
amino acids encoded by a nucleotide sequence substantially as set forth in SEQ ID NO:3 or a 
nucleotide sequence capable of hybridising to tte sequence set forth in SEQ ID N0:3 under low 
stringency conditions at 42**C. 

37 . A method according to claim 3 1 wherein the serine proteinase comprises a sequence of 
amino acids encoded by a nucleotide sequence substantially as set forth in SEQ ID NO:5 or a 
nucleotide sequence having at least 50% similarity thereto or a nucleotide sequence capable of 
hybridising to the sequence set forth in SEQ ID N0:5 under low stringency conditions at 42°C. 
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38. A method according to claim 31 wherein the serine proteinase comprises a sequence of 
amino acids encoded by a nucleotide sequence substantially as set forth in SEQ ID NO:7 or a 
nucleotide sequence having at least 50% similarity thereto or a nucleotide sequence capable of 
hybridising to the sequence set forth in SEQ ID N0:7 under low stringency conditions at 42°C. 

39. A method according to claim 3 1 wherein the kinase comprises an amino acid sequence 
substantially as set forth in SEQ ID NO:10 or having 50% amino acid similarity thereto. 

40. Amethod according to claim 31 wherein the kinase comprises an amino acid sequence 
encoded by a nucleotide sequence substantially as set forth in SEQ ID N0:9 or a nucleotide 
sequence having at least 50% similarity thereto or a nucleotide sequence capable of hybridising 
to the nucleotide sequence set forth in SEQ ID N0:9 under low stringency conditions at 42-C. 

41. A method of modulating fertility in a mammal said method comprising modulating levels 
of HELA2 whereinincreasing levels of HELA2 facilitates sperm maturation and development 

42. A method according to claim 41 wherein fertility is enhanced by introducing 
recombinant HELA2. 

43. A method according to claim 41 wherein fertility is reduced by down regulating 
expression of the HELA2 gene. 

44. A oompsition comprising a serine proteinase and/or kinase capable of regulating cell 
activity and/or viability and one or more pharmaceuticaUy acceptable carriers and/or diluents. 

45. A composition according to claim 44 wherein the serine proteinase is HELA2 or a 
functional derivative tiiereof . 

46. An isolated antibody capable of interacting with a proteinaceous molecule involved in 
or associated with regulation of ceU activity and/or viability comprising a sequence of amino 
acids encoded by a nucleotide sequence, at least a portion of which, is capable of being 
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amplified by polymerase chain reaction (PGR) using the following primers: 

5' ACAGAATTCTGGGTIGTIACIGCIGCICAYTG3' [SEQ ID NO:l]; and 
5'ACAGAATTCAXIGGICCICCIC/GT/AXTCICC3' [SEQ ID NO:21; 
or a complementary form of said primers. 

47. An isolated antibody according to claim 46 wherein said proteinaceous molecule is a 
serine proteinase comprising an amino acid sequence substantially as set forth in SEQ ID NO:4 
or an amino acid sequence having at least 50% similarity thereto. 

48. An isolated antibody according to claim 46 wherein said proteinaceous molecule is a 
serine proteinase comprising an amino acid sequence substantially as set forth in SEQ ID NO:6 
or an amino acid sequence having at least 50% similarity thereto. 

49. An isolated antibody according to claim 46 wherein said proteinaceous molecule is a 
serine proteinase connprising an amino acid sequence substantially as set forth in SEQ ID NO:8 
or an amino acid sequence having at least about 50% similarity thereto. 

50. An isolated antibody according to claini 46 wherein said proteinaceous molecule is a 
serine proteinase comprising a sequence of amino acids encoded by a nucleotide sequence 
substantially as set forth in SEQ ID NO:3 or a nucleotide sequence having at least 50% 
similarity thereto or a nucleotide sequence capable of hybridising to the sequence set forth in 
SEQ ID N0:3 under low stringency conditions at 42°C. 

51. An isolated antibody according to claim 46 wherein said proteinaceous molecule is a 
serine proteinase comprising a sequence of amino acids encoded by a nucleotide sequence 
substantially as set forth in SEQ ID NO:5 or a nucleotide sequence having at least 50% 
similarity thereto or a nucleotide sequence capable of hybridising to the sequence set forth in 
SEQ ID NO:5 under low stringency conditions at 42'C. 
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52. An isolated antibody according to claim 46 wherein said proteinaceous said molecule 
is a serine proteinase comprising a sequence of amino acids encoded by a nucleotide sequence 
substantially as set forth in SEQ ID N0:7 or a nucleotide sequence havmg at least 50% 
similarity thereto or a nucleotide sequence capable of hybridising to the sequence set forth in 
SEQ ID N0:7 under low stringency conditions at 42°C. 

53. An isolated antibody according to claim 46 wherein said proteinaceous molecule is a 
kinase comprising an amino acid sequence substantiaUy as set forth in SEQ p NO: 10 or having 
50% amino acid similarity thereto. 

54. An isolated antibody according to claim 46 wherein said proteinaceous molecule is a 
kinase comprising an amino acid sequence encoded by a nucleotide sequence substantially a& 
set forth in SEQ ID NO:9 or a nucleotide sequence having at least,50% similarity thereto or a 
nucleotide sequence capable of hybridising to tiie nucleotide sequence set fortii in SEQ ID 
NO:9 under low stiingency conditions at 42°C. 

55 . An antagonist or agonist to the isolated proteinaceous molecule according to any one 
of claims 1 to 9. 

56. A method of deteimining a predisposition for or the presence of a cancer, said method 
comprising determining the presence of a nucleotide sequence encoding a proteinaceous 
molecule according to any one of claims 1 to 9. 

57. A method according to claim 56 wherein the nucleotide sequence encodes a 
polypeptide wherein at least a portion of said nucleotide sequence is capable of being amplified 
by polymerase chain reaction (PGR) using tiie following primers: 

5' ACAGAA1TCTGGGTIGTIACIGCIGCICAYTG3' [SEQ ID NO:i]; and 



5'ACAGAATTCAXIGGICCICX3CyGT/AXTCICC3' [SEQ ID NO:2]; 
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or a complementary form of said primers. 

58. A method according to claim 57 wherein said nucleotide sequence encodes a serine 
proteinase comprising an amino acid sequence substantially as set forth in SEQ ID N0:4 or an 
amino acid sequence having at least 50% similarity thereto. 

59. A method according to claim 57 wherein said nucleotide sequence encodes a serine 
proteinase comprising an amino acid sequence substantially as set forth in SEQ ID NO:6 or an 
amino acid sequence having at least 50% similarity thereto. > 

60. A method according to claim 57 wherein said nucleotide sequence encodes a serine 
proteinase con^rising an amino acid sequence substantially as set forth in SEQ ID N0:8 or an 
amino acid sequence having at least about 50% similarity diereto. 

61. A method according to claim 57 wherein said nucleotide sequence is as substantiaUy set 
forth in SEQ ID N0:3 or is a nucleotide sequence having at least 50% similarity thereto or a 
nucleotide sequence capable of hybridising to the sequence set forth in SEQ ID N0:3 under low 
stringency conditions at 42°C. 

62. A medKxi according to claim 57 wherein said nucleotide sequence is as substantially set 
forth in SEQ ID NO:5 or a nucleotide sequence having at least 50% similarity thereto or a 
nucleotide sequence capable of hybridising to the sequence set forth in SEQ ID N0:5 under low 
stringency conditions at 42°C. 

63. A method according to claim57 wherein said nucleotide sequence is as substantially set 
forth in SEQ ID N0:7 or a nucleotide sequence having at least 50% similarity thereto or a 
nucleotide sequence capable of hybridising to the sequence set forth in SEQ ID N0:7 under low 
stringency conditions at 42''C. 

64. A method according to claim 57 wherem said nucleotide sequence is as substantially set 
forth in SEQ ID NO: 10 or having 50% amino acid similarity tiiereto. 
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65. A method according to claim 57 wherein said nucleotide sequence is as substantially set 
forth in SEQ ID NO:9 or a nucleotide sequence having at least 50% similarity thereto or a 
nucleotide sequence capable of hybridising to the nucleotide sequence set forth in SEQ ID 
N0:9 under low stringency conditions at 42''C. 
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EXON III CACTGCTTTGAAACgtgagtgggggtgcgaacggag 
gggtgcggggacgggcaggaacagggctggagggagtgccaccga 
actttacctctggtctgatgccagacttgggcgtgaaagttgtgc 
gtggatgcggcctggtgttctcctgagccccaggctgtgctgcag 
ccggttacacccactccagttccctttgggtctcctggagggaac 
cctgttcaggttattccagaatgttcttccagaacatttccacac 
acttttgggtattctctccctttttctttcaacccaaagttcacc 
actgaccatcccaccctcatcccccctcctggtggacggtgcggt 
acagtgtggggcactgagccaaggccagcacccccgggccgctgt 

INTRON C (716 BP) 

gtggactccatcctgccaatcccacattggcgtggtgcatctccc 
cattcctccttgggctgcatgggggtgcccctggaggccttggct 
caatgcaaggctccttgggacagctctgggaggtgacaagacccc 
acccttctgctgcaggagcaggtcctagactttggttgtggtctg 
tctgggctccttcatttctgcaggggaccctgggtgttagcaagt 
agcagcaacaccacagtttcccctcctgcactggaccccagttgt 
gctcaggtagccagccctccatccagggcccctgactgctctctt 
ctcttctgccagctatagTGACCTTAGTGATCCC EXON IV 



ALTERNATIVE ^ 
SPLICING EVENTS ^--^^^^ 

SHORT ISOFORM LONG ISOFORM 

cactgctttg'aaactgaccttagtgatccc 

H C F E T D L S D P 



CACTGCTTT6AAACCTATA6TGACCTTAGTGATCCC 
HCFETYSDLSDP 

RGURE17 
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FIG 18 (AI) 



FIG 18 (AID 



FIG 18(A) 
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FIG 20A(AI) 



FIG 20A(AII) 



FIG 20A(AIII) 



FIG 20A(A) 
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